Question 1

What is Azure AI Search?

Accepted Answer

Azure AI Search provides a dedicated search engine and persistent storage of your searchable content for full text and vector search scenarios. It also includes optional, integrated AI to extract more text and structure from raw content, and to chunk and vectorize content for vector search.

Question 2

How do I work with Azure AI Search?

Accepted Answer

The primary workflow is create, load, and query an index. Although you can use the Azure portal for most tasks, Azure AI Search is intended to be used programmatically, handling requests from client code. Programmatic support is provided through REST APIs and client libraries in .NET, Python, Java, and JavaScript SDKs for Azure.

Question 3

Are "Azure Search" and "Azure Cognitive Search" and "Azure AI Search" the same product?

Accepted Answer

Azure Search was renamed to Azure Cognitive Search in October 2019 to reflect the expanded (yet optional) use of cognitive skills and AI processing in service operations. Azure Cognitive Search was renamed to Azure AI Search in October 2023 to align with Azure AI services.

Question 4

What languages are supported?

Accepted Answer

For vectors, the embedding models you use determines the linguistic experience.

For nonvector strings and numbers, the default analyzer used for tokenization is standard Lucene, which is language agnostic. Otherwise, language support is expressed through language analyzers that apply linguistic rules to inbound (indexing) and outbound (queries) content. Some features, such as speller and query rewrite, are limited to a subset of languages.

Question 5

How do I integrate search into my solution?

Accepted Answer

Client code should call the Azure SDK client libraries or REST APIs to connect to a search index, formulate queries, and handle responses. You can also write code that builds and refreshes an index, or runs indexers programmatically or by script.

Question 6

Is there functional parity across the various APIs?

Accepted Answer

Not always. The REST API is always the first to implement new features in preview API versions. The client libraries in Azure SDKs will pick up new features over time, but are released on their own schedule.

Although the REST APIs are first out with newest features, the Azure SDKs provide more coding support, and are recommended over REST unless a required feature is unavailable.

Question 7

Can I pause the service and stop billing?

Accepted Answer

You can't pause a search service. In Azure AI Search, computing resources are allocated when the service is created. It's not possible to release and reclaim those resources on-demand.

Question 8

Can I upgrade or downgrade the service?

Accepted Answer

Services created before April 2024 in select regions can be upgraded to higher capacity clusters. Downgrading your service isn't supported.

To get more capacity, you can also switch to a higher pricing tier. Your region can't have capacity constraints on the higher tier, and you can only move up between Basic and Standard (S1, S2, and S3) tiers, such as going from Basic to S1. Currently, you can't switch to a lower tier.

Question 9

Can I rename or move the service?

Accepted Answer

Service name and region are fixed for the lifetime of the service.

Question 10

If I migrate my search service to another subscription or resource group, should I expect any downtime?

Accepted Answer

As long as you follow the checklist before moving resources and make sure each step is completed, there shouldn't be any downtime.

Question 11

Why do I see different storage limits for same-tier search services?

Accepted Answer

Storage limits can vary by service creation date. In most supported regions, newer services have higher storage limits than older services, even if they're on the same tier. However, you might be able to upgrade your old service to access the new limits.

Question 12

What does "indexing" mean in Azure AI Search?

Accepted Answer

It refers to the ingestion, parsing, and storing of textual content and tokens that populate a search index. Indexing creates inverted indexes and other physical data structures that support information retrieval.

It creates vector indexes if the schema includes vector fields.

Question 13

Can I move, backup, and restore indexes?

Accepted Answer

There's no native support for porting indexes. Search indexes are considered downstream data structures, accepting content from other data sources that collect operational data. As such, there's no built-in support for backing up and restoring indexes because the expectation is that you would rebuild an index from source data if you deleted it, or wanted to move it.

However, if you want to move an index between search services, you can try the index-backup-restore sample code in this Azure AI Search .NET sample repo. There's also a Python version of backup and restore.

Question 14

Can I restore my index or service once it's deleted?

Accepted Answer

No, if you delete an Azure AI Search index or service, it can't be recovered. When you delete a search service, all indexes in the service are deleted permanently.

Question 15

Can I index from SQL Database replicas?

Accepted Answer

If you're using the search indexer for Azure SQL Database, there are no restrictions on the use of primary or secondary replicas as a data source when building an index from scratch. However, refreshing an index with incremental updates (based on changed records) requires the primary replica. This requirement comes from SQL Database, which guarantees change tracking on primary replicas only. If you try using secondary replicas for an index refresh workload, there's no guarantee you get all of the data.

Question 16

What is vector search?

Accepted Answer

Vector search is a technique that finds the most similar documents by comparing their vector representations. Since the goal of a vector representation is to capture the essential characteristics of an item in a numerical format, vector queries can identify similar content even if there are no explicit matches based on keywords or tags. When a user performs a search, the query is summarized into a vector representation and the vector search engine identifies the most similar documents. To improve efficiency on large databases, vector search often provides the approximate nearest neighbors for a query vector. See Vector search overview for the specifics of Azure AI Search's vector offering.

Question 17

Does Azure AI Search support vector search?

Accepted Answer

Azure AI Search supports vector indexing and retrieval. It can chunk and vectorize query strings and content if you use integrated vectorization and take a dependency on indexers and skillsets.

Question 18

How does vector search work in Azure AI Search?

Accepted Answer

With standalone vector search, you first use an embedding model to transform content into a vector representation within an embedding space. You can then provide these vectors in a document payload to the search index for indexing. To serve search requests, you use the same embedding model to transform the search query into a vector representation, and vector search finds the most similar vectors and return the corresponding documents.

In Azure AI Search, you can index vector data as fields in documents alongside textual and other types of content. There are multiple data types for vector fields.

Vector queries can be issued standalone or in combination with other query types, including term queries and filters in the same search request.

Question 19

Can Azure AI Search vectorize my content or queries?

Accepted Answer

Built-in integrated vectorization is now generally available.

Question 20

Does my search service support vector search?

Accepted Answer

Most existing services support vector search. If you're using a package or API that supports vector search and index creation fails, the underlying search service doesn't support vector search, and a new service must be created. This can occur for a small subset of services created prior to January 1, 2019.

Question 21

Can I add vector search to an existing index?

Accepted Answer

If your search service supports vector search, both existing and new indexes can accommodate vector fields.

Question 22

Why do I see different vector index size limits between my new search services and existing search services?

Accepted Answer

Azure AI Search rolled out improved vector index size limits worldwide for new search services, but some regions experience capacity constraints, and some regions don't have the required infrastructure. New search services created after May 2024 in supported regions should see increased vector index size limits. Alternatively, if you have an existing service in a supported region, you can upgrade your service to access the new limits.

Question 23

Why does my vector index show zero storage?

Accepted Answer

Only vector indexes that use the Hierarchical Navigable Small World (HNSW) algorithm report on vector index size in the Azure portal. If your index uses exhaustive KNN, vector index size is reported as zero, even though the index contains vectors.

Question 24

How do I enable vector search on a search index?

Accepted Answer

To enable vector search in an index, you should:

Add one or more vector fields to a field collection.
Add a "vectorSearch" section to the index schema specifying the configuration used by vector search fields, including the parameters of the Approximate Nearest Neighbor algorithm used, like HNSW.
Use the latest stable version, 2024-07-01, or an Azure SDK to create or update the index, load documents, and issue queries. For more information, see Create a vector index.

Question 25

Where does query execution occur?

Accepted Answer

Queries execute over a single search index that's hosted on your search service. You can't join multiple indexes to search content in two or more indexes, but you can query same-name indexes in multiple search services.

Question 26

Why are there zero matches on terms I know to be valid?

Accepted Answer

The most common case isn't knowing that each query type supports different search behaviors and levels of linguistic analyses. Full text search, which is the predominant workload, includes a language analysis phase that breaks down terms to root forms. This aspect of query parsing casts a broader net over possible matches, because the tokenized term matches a greater number of variants.

Wildcard, fuzzy and regex queries, however, aren't analyzed like regular term or phrase queries and can lead to poor recall if the query doesn't match the analyzed form of the word in the search index. For more information on query parsing and analysis, see query architecture.

Question 27

Why are my wildcard searches slow?

Accepted Answer

Most wildcard search queries, like prefix, fuzzy and regex, are rewritten internally with matching terms in the search index. This extra processing adds to latency. Further, broad search queries, like a* for example, are likely to be rewritten with many terms, which can be slow. For performant wildcard searches, consider defining a custom analyzer.

Question 28

Can I search across multiple indexes?

Accepted Answer

No, a query is always scoped to a single index.

Question 29

Why is the search score a constant 1.0 for every match?

Accepted Answer

Search scores are generated for full text search queries, based on the statistical properties of matching terms, and ordered high to low in the result set. Query types that aren't full text search (wildcard, prefix, regex) aren't ranked by a relevance score. This behavior is by design. A constant score allow matches found through query expansion to be included in the results, without affecting the ranking.

For example, suppose an input of "tour*" in a wildcard search produces matches on "tours", "tourettes", and "tourmaline". Given the nature of these results, there's no way to reasonably infer which terms are more valuable than others. For this reason, term frequencies are ignored when scoring results in queries of types wildcard, prefix, and regex. Search results based on a partial input are given a constant score to avoid bias towards potentially unexpected matches.

Question 30

Where does Azure AI Search store customer data?

Accepted Answer

It stores your data in the geography (Geo) where your service is deployed. Microsoft might replicate your data within the same geo for high availability and durability. For more information, see data residency in Azure.

Question 31

Does Azure AI Search send customer data to other services for processing?

Accepted Answer

Yes, skills and vectorizers make outbound calls from Azure AI Search to other Azure resources or external models that you specify for embedding or chat. Calls to those APIs typically contain raw content to be processed or queries that are vectorized by an embedding model. For Azure-to-Azure connections, the service sends requests over the internal network. If you add a custom skill or vectorizer, the indexer sends content to the URI provided in the custom skill over the public network unless you configure a shared private link.

Question 32

Does Azure AI Search process customer data in other regions?

Accepted Answer

Processing (vectorization or applied AI transformations) is performed in the Geo that hosts the Azure AI services used by skills, or the Azure apps or functions hosting custom skills, or the Azure OpenAI or Azure AI Foundry region that hosts your deployed models. These resources are specified by you, so you can choose whether to deploy them in the same Geo as your search service or not.

If you send data to external (non-Azure) models or services, the processing location is determined by the external service.

Question 33

Can I control access to search results based on user identity?

Accepted Answer

You can if you implement a solution that associates documents with a user identity. Typically, users who are authorized to run your application are also authorized to see all search results. Azure AI Search doesn't have built-in support for row-level or document-level permissions, but you can implement security filters as a workaround. For steps and script, see Get started with the Python enterprise chat sample using RAG.

Question 34

Can I control access to operations based on user identity?

Accepted Answer

Yes, you can use role-based authorization for data plane operations over content.

Question 35

Can I use the Azure portal to view and manage search content if the search service is behind an IP firewall or a private endpoint?

Accepted Answer

You can use the Azure portal on a network-protected search service if you create a network exception that allows client and portal access. For more information, see connect through an IP firewall or connect through a private endpoint.

Del via

Azure AI Search Frequently Asked Questions

General