Training
Certification
Microsoft Certified: Azure AI Engineer Associate - Certifications
Design and implement an Azure AI solution using Azure AI services, Azure AI Search, and Azure Open AI.
This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
Azure AI Search is an information retrieval platform for the enterprise. It supports traditional search and conversational AI-driven search for "chat with your data" experiences over your proprietary content.
The easiest way to create a service is using the Azure portal, which is covered in this article.
You can also use Azure PowerShell, Azure CLI, the Management REST API, an Azure Resource Manager service template, a Bicep file, or Terraform.
A few service properties are fixed for the lifetime of the service. Before creating the service, decide on a name, region, and tier.
Service name becomes part of the URL endpoint. The name must be unique and it must conform to naming rules.
Region determines data residency and the availability of certain features. Semantic ranker and Azure AI integration come with region requirements. Make sure your region of choice supports the features you need.
Service tier determines infrastructure, service limits, and billing. Some features aren't available on lower or specialized tiers.
Paid (or billable) search occurs when you choose a billable tier (Basic or higher) when creating the resource on a billable Azure subscription.
To try Azure AI Search for free, open a trial subscription and then create your search service by choosing the Free tier. You can have one free search service per Azure subscription. Free search services are intended for short-term evaluation of the product for nonproduction applications. Generally, you can complete all of the quickstarts and most tutorials, except for those featuring semantic ranker (it requires a billable service). Free services that are inactive for an extended period of time can be deleted by Microsoft to make room for other services.
Alternatively, you can use free credits to try out paid Azure services. With this approach, you can create your search service at Basic or higher to get more capacity. Your credit card is never charged unless you explicitly change your settings and ask to be charged. Another approach is to activate Azure credits in a Visual Studio subscription. A Visual Studio subscription gives you credits every month you can use for paid Azure services.
Sign in to the Azure portal.
Select (Create Resource") in the top-left corner.
Use the search bar to find "Azure AI Search*.
If you have more than one subscription, choose one for your search service. If you're implementing customer-managed encryption or if you use other features that depend on managed service identities for external data access, choose the same subscription as the one used for Azure Key Vault or other services for which managed identities are used.
A resource group is a container that holds related resources for your Azure solution. It's useful for consolidating same-solution resources, monitoring costs, and for checking the creation date of your search service.
Over time, you can track current and projected costs all-up or you can view charges for individual resources. The following screenshot shows the kind of cost information you can expect to see when you combine multiple resources into one group.
Tip
Resource groups simplify cleanup because deleting a resource group deletes everything within it.
In Instance Details, provide a service name in the URL field. The name is part of the endpoint against which API calls are issued: https://your-service-name.search.windows.net
. For example, if you want the endpoint to be https://myservice.search.windows.net
, you would enter myservice
.
Service name requirements:
-
)Tip
If you have multiple search services, it helps to include the region (or location) in the service name as a naming convention. A name like mysearchservice-westus
can save you a trip to the properties page when deciding how to combine or attach resources.
Important
Due to high demand, Azure AI Search is currently unavailable for new instances in some regions.
If you use multiple Azure services, putting all of them in the same region minimizes or voids bandwidth charges. There are no charges for data egress among same-region services.
Generally, choose a region near you, unless the following considerations apply:
Your nearest region is at capacity. One advantage to using the Azure portal for resource setup is that it provides only those regions and tiers that are available.
You want to use integrated data chunking and vectorization or built-in skills for AI enrichment. Integrated operations have region requirements.
You want to use Azure Storage for indexer-based indexing or you need to store application data that isn't in an index. Debug session state, enrichment caches, and knowledge stores are Azure AI Search features that have a dependency on Azure Storage. The region you choose for Azure Storage has implications for network security. Specifically, if you're setting up a firewall, you should place the resources in separate regions. For more information, see Outbound connections from Azure AI Search to Azure Storage.
Is Azure AI Search available in a nearby region? Check the supported regions list.
Do you have a specific tier in mind? Check region availability by tier.
Do you have business continuity and disaster recovery (BCDR) requirements? Create two or more search services in regional pairs within availability zones. For example, if you're operating in North America, you might choose East US and West US, or North Central US and South Central US, for each search service.
Do you need AI enrichment, integrated data chunking and vectorization, or multimodal image search? Azure AI Search, Azure OpenAI, and Azure AI multiservice must coexist in the same region.
Start with Azure OpenAI regions because it has the most variability. Azure OpenAI provides embedding models and chat models for RAG and integrated vectorization.
Check Azure AI Search regions for a match to your Azure OpenAI region. If you're using OCR, entity recognition, or other skills backed by Azure AI, the AI Integration column indicates whether Azure AI multiservice is in the same region as Azure AI Search.
Check multimodal embedding regions for multimodal APIs and image search. This API is accessed through an Azure AI multiservice account, but it's available in fewer regions than Azure AI multiservice in general.
Currently, the following regions offer cross-region among all three services (Azure AI Search, Azure OpenAI, Azure AI Vision multimodal). This list isn't definitive, and there might be more choices beyond the regions listed here depending on the tier. Also, region status can change quickly, so be sure to confirm region choice before installing.
Azure AI Search is offered in multiple pricing tiers: Free, Basic, Standard, or Storage Optimized. Each tier has its own capacity and limits. There are also several features that are tier-dependent.
Review the tier descriptions for computing characteristics, feature availability, and region availability.
Basic and Standard are the most common choices for production workloads, but many customers start with the Free service. Among the billable tiers, key differences are partition size and speed, and limits on the number of objects you can create.
Search services created after April 3, 2024 have larger partitions and higher vector quotas at every billable tier.
Remember, a pricing tier can't be changed once the service is created. If you need a higher or lower tier, you should re-create the service.
After you've provided the necessary inputs, go ahead and create the service.
Your service is deployed within minutes. You can monitor progress through Azure notifications. Consider pinning the service to your dashboard for easy access in the future.
Unless you're using the Azure portal, programmatic access to your new service requires that you provide the URL endpoint and an authenticated connection. You can use either or both of these options:
When setting up a programmatic connection, you need the search service endpoint. On the Overview page, locate and copy the URL endpoint on the right side of the page.
To set authentication options, use the Keys page. Most quickstarts and tutorials use API keys for simplicity, but if you're setting up a service for production workloads, consider using Azure roles. You can copy keys from this page.
An endpoint and key aren't needed for portal-based tasks. the Azure portal is already linked to your Azure AI Search resource with admin rights. For a portal walkthrough, start with Quickstart: Create an Azure AI Search index in the Azure portal.
After a search service is provisioned, you can scale it to meet your needs. On a billable tier, you can scale the service in two dimensions: replicas and partitions. For the free service, scale up isn't available and replica and partition configuration isn't offered.
Partitions allow your service to store and search through more documents.
Replicas allow your service to handle a higher load of search queries.
Adding resources increases your monthly bill. The pricing calculator can help you understand the billing ramifications of adding resources. Remember that you can adjust resources based on load. For example, you might increase resources to create a full initial index, and then reduce resources later to a level more appropriate for incremental indexing.
Important
A service must have 2 replicas for read-only SLA and 3 replicas for read/write SLA.
Most customers use just one service provisioned at a tier sufficient for expected load. One service can host multiple indexes, subject to the maximum limits of the tier you select, with each index isolated from another. In Azure AI Search, requests can only be directed to one index, minimizing the chance of accidental or intentional data retrieval from other indexes in the same service.
Although most customers use just one service, service redundancy might be necessary if operational requirements include the following:
Business continuity and disaster recovery (BCDR). Azure AI Search doesn't provide instant failover if there's an outage.
Multitenant architectures sometimes call for two or more services.
Globally deployed applications might require search services in each geography to minimize latency.
Note
In Azure AI Search, you cannot segregate indexing and querying operations; thus, you would never create multiple services for segregated workloads. An index is always queried on the service in which it was created (you cannot create an index in one service and copy it to another).
A second service isn't required for high availability. High availability for queries is achieved when you use two or more replicas in the same service. Replica updates are sequential, which means at least one is operational when a service update is rolled out. For more information about uptime, see Service Level Agreements.
Azure AI Search restricts the number of search services you can initially create in a subscription. If you exhaust your maximum limit, you can request more quota.
You must have Owner or Contributor permissions on the subscription to request quota. Depending on region and datacenter capacity, you can automatically request more quota to add services to your subscription. If the request fails, you should either decrease the number or file a support ticket. For a large increase in quota, such as more than 30 extra services, you should expect a one-month turnaround.
Sign in to the Azure portal, search for "quotas" in your dashboard, and then select the Quotas service.
In the Quota's Overview page, select Search.
Set filters so that you can review existing quota for search services in the current subscription. We recommend filtering by usage.
Find the region and tier that needs more quota and select the Edit pencil icon to begin your request.
In Quota details, specify the location, tier, and a new limit for your subscription quota. None of the values can be empty. The new limit must be greater than the current limit. If regional capacity is constrained, your request won't be automatically approved. In this scenario, an incident report is generated on your behalf for investigation and resolution.
Submit the request.
Monitor notifications in the Azure portal for status updates on the new limit. Most requests are approved within 24 hours.
After provisioning a service, you can continue in the Azure portal to create your first index.
Want to optimize and save on your cloud spending?
Training
Certification
Microsoft Certified: Azure AI Engineer Associate - Certifications
Design and implement an Azure AI solution using Azure AI services, Azure AI Search, and Azure Open AI.