Azure OpenAI On Your Data

Note

This document refers to the Microsoft Foundry (classic) portal.

🔍 View the Microsoft Foundry (new) documentation to learn about the new portal.

Use this article to learn about Azure OpenAI On Your Data, which makes it easier for developers to connect, ingest and ground their enterprise data to create personalized copilots (preview) rapidly. It enhances user comprehension, expedites task completion, improves operational efficiency, and aids decision-making.

What is Azure OpenAI On Your Data

Azure OpenAI On Your Data enables you to run advanced AI models such as GPT-35-Turbo and GPT-4 on your own enterprise data without needing to train or fine-tune models. You can chat on top of and analyze your data with greater accuracy. You can specify sources to support the responses based on the latest information available in your designated data sources. You can access Azure OpenAI On Your Data using a REST API, via the SDK or the web-based interface in the Microsoft Foundry portal. You can also create a web app that connects to your data to enable an enhanced chat solution or deploy it directly as a copilot in the Copilot Studio (preview).

Developing with Azure OpenAI On Your Data

A diagram showing an example workflow.

Typically, the development process you'd use with Azure OpenAI On Your Data is:

Ingest: Upload files using either Foundry portal or the ingestion API. This enables your data to be cracked, chunked and embedded into an Azure AI Search instance that can be used by Azure OpenAI models. If you have an existing supported data source, you can also connect it directly.
Develop: After trying Azure OpenAI On Your Data, begin developing your application using the available REST API and SDKs, which are available in several languages. It will create prompts and search intents to pass to the Azure OpenAI service.
Inference: After your application is deployed in your preferred environment, it will send prompts to Azure OpenAI, which will perform several steps before returning a response:
1. Intent generation: The service will determine the intent of the user's prompt to determine a proper response.
2. Retrieval: The service retrieves relevant chunks of available data from the connected data source by querying it. For example by using a semantic or vector search. Parameters such as strictness and number of documents to retrieve are utilized to influence the retrieval.
3. Filtration and reranking: Search results from the retrieval step are improved by ranking and filtering data to refine relevance.
4. Response generation: The resulting data is submitted along with other information like the system message to the Large Language Model (LLM) and the response is sent back to the application.

To get started, connect your data source using Foundry portal and start asking questions and chatting on your data.

Azure Role-based access controls (Azure RBAC) for adding data sources

To use Azure OpenAI On Your Data fully, you need to set one or more Azure RBAC roles. See Azure OpenAI On Your Data configuration for more information.

Data formats and file types

Azure OpenAI On Your Data supports the following file types:

.txt
.md
.html
.docx
.pptx
.pdf

There's an upload limit, and there are some caveats about document structure and how it might affect the quality of responses from the model:

If you're converting data from an unsupported format into a supported format, optimize the quality of the model response by ensuring the conversion:
- Doesn't lead to significant data loss.
- Doesn't add unexpected noise to your data.
If your files have special formatting, such as tables and columns, or bullet points, prepare your data with the data preparation script available on GitHub.
For documents and datasets with long text, you should use the available data preparation script. The script chunks data so that the model's responses are more accurate. This script also supports scanned PDF files and images.

Supported data sources

You need to connect to a data source to upload your data. When you want to use your data to chat with an Azure OpenAI model, your data is chunked in a search index so that relevant data can be found based on user queries.

Note

Your data should be unstructured text for best results. If you have non-textual semi-structured or structured data consider converting it to text. If your files have special formatting, such as tables and columns, or bullet points, prepare your data with the data preparation script available on GitHub.

The Integrated Vector Database in vCore-based Azure Cosmos DB for MongoDB natively supports integration with Azure OpenAI On Your Data.

For some data sources such as uploading files from your local machine (preview) or data contained in a blob storage account (preview), Azure AI Search is used. When you choose the following data sources, your data is ingested into an Azure AI Search index.

Data ingested through Azure AI Search	Description
Azure AI Search	Use an existing Azure AI Search index with Azure OpenAI On Your Data.
Upload files (preview)	Upload files from your local machine to be stored in an Azure Blob Storage database, and ingested into Azure AI Search.
URL/Web address (preview)	Web content from the URLs is stored in Azure Blob Storage.
Azure Blob Storage (preview)	Upload files from Azure Blob Storage to be ingested into an Azure AI Search index.

You might want to consider using an Azure AI Search index when you either want to:

Customize the index creation process.
Reuse an index created before by ingesting data from other data sources.

Note

To use an existing index, it must have at least one searchable field.
Set the CORS Allow Origin Type option to all and the Allowed origins option to *.
You cannot have complex fields in your search index.

Search types

Azure OpenAI On Your Data provides the following search types you can use when you add your data source.

Keyword search
Semantic search
Vector search using the text-embedding-ada-002 embedding model, available in selected regions

To enable vector search, you need an existing embedding model deployed in your Azure OpenAI resource. Select your embedding deployment when connecting your data, then select one of the vector search types under Data management. If you're using Azure AI Search as a data source, make sure you have a vector column in the index.

If you're using your own index, you can customize the field mapping when you add your data source to define the fields that will get mapped when answering questions. To customize field mapping, select Use custom field mapping on the Data Source page when adding your data source.

Important

Semantic search is subject to additional pricing. You need to choose Basic or higher SKU to enable semantic search or vector search. See pricing tier difference and service limits for more information.
To help improve the quality of the information retrieval and model response, we recommend enabling semantic search for the following data source languages: English, French, Spanish, Portuguese, Italian, Germany, Chinese(Zh), Japanese, Korean, Russian, and Arabic.

Search option	Retrieval type	Additional pricing?	Benefits
keyword	Keyword search	No additional pricing.	Performs fast and flexible query parsing and matching over searchable fields, using terms or phrases in any supported language, with or without operators.
semantic	Semantic search	Additional pricing for semantic search usage.	Improves the precision and relevance of search results by using a reranker (with AI models) to understand the semantic meaning of query terms and documents returned by the initial search ranker
vector	Vector search	Additional pricing on your Azure OpenAI account from calling the embedding model.	Enables you to find documents that are similar to a given query input based on the vector embeddings of the content.
hybrid (vector + keyword)	A hybrid of vector search and keyword search	Additional pricing on your Azure OpenAI account from calling the embedding model.	Performs similarity search over vector fields using vector embeddings, while also supporting flexible query parsing and full text search over alphanumeric fields using term queries.
hybrid (vector + keyword) + semantic	A hybrid of vector search, semantic search, and keyword search.	Additional pricing on your Azure OpenAI account from calling the embedding model, and additional pricing for semantic search usage.	Uses vector embeddings, language understanding, and flexible query parsing to create rich search experiences and generative AI apps that can handle complex and diverse information retrieval scenarios.

Intelligent search

Azure OpenAI On Your Data has intelligent search enabled for your data. Semantic search is enabled by default if you have both semantic search and keyword search. If you have embedding models, intelligent search defaults to hybrid + semantic search.

Document-level access control

Note

Document-level access control is supported when you select Azure AI Search as your data source.

Azure OpenAI On Your Data lets you restrict the documents that can be used in responses for different users with Azure AI Search security filters. When you enable document level access, the search results returned from Azure AI Search and used to generate a response are trimmed based on user Microsoft Entra group membership. You can only enable document-level access on existing Azure AI Search indexes. See Azure OpenAI On Your Data network and access configuration for more information.

Index field mapping

If you're using your own index, you'll be prompted in the Foundry portal to define which fields you want to map for answering questions when you add your data source. You can provide multiple fields for Content data, and should include all fields that have text pertaining to your use case.

In this example, the fields mapped to Content data and Title provide information to the model to answer questions. Title is also used to title citation text. The field mapped to File name generates the citation names in the response.

Mapping these fields correctly helps ensure the model has better response and citation quality. You can additionally configure it in the API using the fieldsMapping parameter.

If you want to implement additional value-based criteria for query execution, you can set up a search filter using the filter parameter in the REST API.

How data is ingested into Azure AI search

As of September 2024, the ingestion APIs switched to integrated vectorization. This update does not alter the existing API contracts. Integrated vectorization, a new offering of Azure AI Search, utilizes prebuilt skills for chunking and embedding the input data. The Azure OpenAI On Your Data ingestion service no longer employs custom skills. Following the migration to integrated vectorization, the ingestion process has undergone some modifications and as a result only the following assets are created:

{job-id}-index
{job-id}-indexer, if an hourly or daily schedule is specified, otherwise, the indexer is cleaned-up at the end of the ingestion process.
{job-id}-datasource

The chunks container is no longer available, as this functionality is now inherently managed by Azure AI Search.

Data connection

You need to select how you want to authenticate the connection from Azure OpenAI, Azure AI Search, and Azure blob storage. You can choose a System assigned managed identity or an API key. By selecting API key as the authentication type, the system will automatically populate the API key for you to connect with your Azure AI Search, Azure OpenAI, and Azure Blob Storage resources. By selecting System assigned managed identity, the authentication will be based on the role assignment you have. System assigned managed identity is selected by default for security.

Once you select the next button, it will automatically validate your setup to use the selected authentication method. If you encounter an error, see the role assignments article to update your setup.

Once you have fixed the setup, select next again to validate and proceed. API users can also configure authentication with assigned managed identity and API keys.

You might want to use Azure Blob Storage as a data source if you want to connect to existing Azure Blob Storage and use files stored in your containers.

Schedule automatic index refreshes

Note

Automatic index refreshing is supported for Azure Blob Storage only.

To keep your Azure AI Search index up-to-date with your latest data, you can schedule an automatic index refresh rather than manually updating it every time your data is updated. Automatic index refresh is only available when you choose Azure Blob Storage as the data source. To enable an automatic index refresh:

Add a data source using Foundry portal.
Under Select or add data source select Indexer schedule and choose the refresh cadence you would like to apply.

After the data ingestion is set to a cadence other than once, Azure AI Search indexers will be created with a schedule equivalent to 0.5 * the cadence specified. This means that at the specified cadence, the indexers will pull, reprocess, and index the documents that were added or modified from the storage container. This process ensures that the updated data gets preprocessed and indexed in the final index at the desired cadence automatically. To update your data, you only need to upload the additional documents from the Azure portal. From the portal, select Storage Account > Containers. Select the name of the original container, then Upload. The index will pick up the files automatically after the scheduled refresh period. The intermediate assets created in the Azure AI Search resource won't be cleaned up after ingestion to allow for future runs. These assets are:

{Index Name}-index
{Index Name}-indexer
{Index Name}-datasource
{Index Name}-skillset

To modify the schedule, you can use the Azure portal.

Open your search resource page in the Azure portal
Select Indexers from the left pane
Perform the following steps on the two indexers that have your index name as a prefix.
1. Select the indexer to open it. Then select the settings tab.
2. Update the schedule to the desired cadence from "Schedule" or specify a custom cadence from "Interval (minutes)"
3. Select Save.

How data is ingested into Azure AI search

{job-id}-index
{job-id}-indexer, if an hourly or daily schedule is specified, otherwise, the indexer is cleaned-up at the end of the ingestion process.
{job-id}-datasource

The chunks container is no longer available, as this functionality is now inherently managed by Azure AI Search.

Data connection

Once you have fixed the setup, select next again to validate and proceed. API users can also configure authentication with assigned managed identity and API keys.

Using Foundry portal, you can upload files from your machine to try Azure OpenAI On Your Data. You also have the option to create a new Azure Blob Storage account and Azure AI Search resource. The service then stores the files to an Azure storage container and performs ingestion from the container. You can use the quickstart article to learn how to use this data source option.

How data is ingested into Azure AI search

{job-id}-index
{job-id}-indexer, if an hourly or daily schedule is specified, otherwise, the indexer is cleaned-up at the end of the ingestion process.
{job-id}-datasource

The chunks container is no longer available, as this functionality is now inherently managed by Azure AI Search.

Data connection

Once you have fixed the setup, select next again to validate and proceed. API users can also configure authentication with assigned managed identity and API keys.

You can paste URLs and the service will store the webpage content, using it when generating responses from the model. The content in URLs/web addresses that you use need to have the following characteristics to be properly ingested:

A public website, such as Using your data with Azure OpenAI in Foundry Models - Azure OpenAI | Microsoft Learn. You can't add a URL/Web address with access control, such as ones with a password.
An HTTPS website.
The size of content in each URL is smaller than 5 MB.
The website can be downloaded as one of the supported file types.
Only one layer of nested links is supported. Only up to 20 links, on the web page will be fetched.

Once you have added the URL/web address for data ingestion, the web pages from your URL are fetched and saved to Azure Blob Storage with a container name: webpage-<index name>. Each URL will be saved into a different container within the account. Then the files are indexed into an Azure AI Search index, which is used for retrieval when you’re chatting with the model.

How data is ingested into Azure AI search

{job-id}-index
{job-id}-indexer, if an hourly or daily schedule is specified, otherwise, the indexer is cleaned-up at the end of the ingestion process.
{job-id}-datasource

The chunks container is no longer available, as this functionality is now inherently managed by Azure AI Search.

Data connection

Once you have fixed the setup, select next again to validate and proceed. API users can also configure authentication with assigned managed identity and API keys.

You can connect to your Elasticsearch vector database and chat with your data.

Prerequisites

An Elasticsearch database
An embedding model. You can:
- Use an existing Azure OpenAI text-embedding-ada-002 embedding model, or
- Bring your own embedding model hosted on Elasticsearch.
Prepare your data using the python notebook available on GitHub.

Request access

Using the Elasticsearch data source is a preview feature which is subject to the Limited Access Service terms in the service-specific terms. You must fill out and submit a request form to request access to the Elasticsearch data source. The form requests information about your company and the scenario for which you plan to use the Elasticsearch data source. After you submit the form, the Azure OpenAI team will review it and email you with a decision within 10 business days.