What is a knowledge source?

Note

Azure AI Search is available through the Azure portal, REST APIs, and Azure SDKs. It also underpins Foundry IQ, the managed knowledge layer that transforms enterprise content into reusable, permission-aware knowledge bases for agents in the Microsoft Foundry portal.

Note

Some agentic retrieval features are generally available in the 2026-04-01 REST API via programmatic access. The Azure portal and Microsoft Foundry portal continue to provide preview-only access to all agentic retrieval features. For migration guidance, including a breakdown of what's generally available and what remains in preview, see Migrate agentic retrieval code to the latest version.

If you choose to use a preview REST API, you can access agentic retrieval capabilities that aren't yet generally available. Preview features are provided without a service-level agreement and aren't recommended for production workloads. For more information, see Supplemental Terms of Use for Microsoft Azure Previews.

Important

These features and functionality are part of the 2026-05-01-preview REST API. The 2026-05-01-preview is licensed to you as part of your Azure subscription and is subject to the terms applicable to "Previews" in the Microsoft Product Terms, the Microsoft Products and Services Data Protection Addendum ("DPA"), and the Supplemental Terms of Use for Microsoft Azure Previews.

The 2026-05-01-preview supports connections to other Microsoft services and third-party services. Use of these services is subject to their respective terms and might result in data processing or storage outside of the Azure compliance boundary, as well as data flowing into the Azure compliance boundary.

It's your responsibility to manage whether your data will flow outside of your organization's compliance and geographic boundaries and any related implications, and that appropriate permissions, boundaries, and approvals are provisioned.

You're responsible for carefully reviewing and testing applications you build in the context of your specific use cases and making all appropriate decisions and customizations. This includes implementing your own responsible AI mitigations, such as metaprompts, content filters, or other safety systems, and ensuring your applications meet appropriate quality, reliability, security, and trustworthiness standards. For more information, see the Azure AI Search Transparency Note.

A knowledge source is a top-level resource on your Azure AI Search service that defines the content used in an agentic retrieval pipeline. Each knowledge source is either indexed or remote, which determines how the content is ingested, processed, and queried. Knowledge sources are required components of a knowledge base.

You can reference multiple knowledge sources in a single knowledge base. The agentic retrieval engine queries all of them in a single request. Subqueries are generated for each knowledge source, and the top results are returned in the retrieval response.

Supported knowledge sources

Azure AI Search supports the following knowledge sources for agentic retrieval workloads.

Kind	Description	Indexed or remote
Search index	Wraps an existing index.	Indexed
Azure blob	Generates an indexer pipeline from a blob container.	Indexed
Azure SQL (preview)	Generates an indexer pipeline from an Azure SQL table or view.	Indexed
File (preview)	Uploads files directly to Azure AI Search.	Indexed
OneLake	Generates an indexer pipeline from a lakehouse.	Indexed
Indexed SharePoint (preview)	Generates an indexer pipeline from a SharePoint site.	Indexed
Remote SharePoint (preview)	Retrieves content from SharePoint.	Remote
Fabric Data Agent (preview)	Retrieves answers and embedded resources from a Microsoft Fabric data agent.	Remote
Fabric Ontology (preview)	Retrieves entity- and relationship-based answers from a Microsoft Fabric ontology.	Remote
MCP server (preview)	Retrieves live, tool-backed results from an external MCP server.	Remote
Work IQ (preview)	Retrieves organizational intelligence from Work IQ.	Remote
Web	Retrieves real-time grounding data from Microsoft Bing.	Remote

Indexed knowledge sources

An indexed knowledge source points to a search index that meets the criteria for agentic retrieval. Content is ingested into the index before query time through one of three paths:

Bring your own index: Use a search index knowledge source to wrap an existing index on your search service.
Direct file upload: Use a file knowledge source to upload files directly to Azure AI Search. The service processes the files and stores extracted content in a generated search index, with no external storage or indexer pipeline required.
Auto-generated indexer pipeline: For all other indexed knowledge sources, Azure AI Search automatically creates a complete indexer pipeline from your external data source. This includes a data source, skillset, indexer, and index that's populated and chunked.

Queries run locally on your search service using keyword (full text), vector, or hybrid queries.

Remote knowledge sources

A remote knowledge source connects directly to an external platform. Content is never ingested into Azure AI Search. Instead, it's retrieved at query time via each platform's native APIs. The agentic retrieval engine makes the API call and surfaces results alongside any indexed knowledge sources in the same response.

Depending on the platform, remote connections reach content either over the public internet (such as Bing) or within your Microsoft tenant (such as SharePoint and Fabric).

Unified ranking

For both indexed and remote knowledge sources, all retrieved content flows through the same ranking pipeline. Results are scored for relevance, merged across queries, and reranked before returning in the retrieval response.

Working with knowledge sources

Knowledge sources are independent objects that you create and manage separately from knowledge bases. Keep the following in mind:

Create a knowledge source before you create a knowledge base. Knowledge bases reference knowledge sources by ID, so the knowledge source must exist first.
To delete a knowledge source, first update or delete any knowledge bases that reference it. You can then delete the knowledge source.
A knowledge source and its knowledge base must exist on the same search service.

Creating knowledge sources

To create a knowledge source, you need Search Service Contributor permissions on your search service. If the knowledge source generates an indexer pipeline, you also need Search Index Data Contributor permissions to load an index. You can use an admin API key as an alternative to role assignments.

Creation support in the Azure portal, Microsoft Foundry portal, REST API, and Azure SDKs varies by knowledge source kind. For per-kind instructions, see the links in Supported knowledge sources.

Ingest sensitivity labels (preview)

For blob, indexed OneLake, and indexed SharePoint knowledge sources, you can ingest Microsoft Purview sensitivity labels by setting ingestionPermissionOptions to include sensitivityLabel. Follow all the prerequisites before you set this value. After they're synchronized to the index, labels are surfaced in retrieve responses and used to enforce document-level access at query time. For more information, see Enforce permissions at query time (preview).

If your indexed knowledge source uses a chunked index, such as with integrated vectorization or a custom Text Split skill, you must also map the sensitivity label to each chunk row via index projections in the skillset. Otherwise, chunk-level references in retrieve responses won't be returned if they have labels in the source document.

Surface document-embedded images (preview)

For blob, indexed OneLake, and indexed SharePoint knowledge sources, you can configure an assetStore in the knowledge source's ingestionParameters to persist images that are embedded in your source documents. When you also enable image serving on the knowledge base, the retrieve action injects those images into the answer synthesis prompt so the LLM can reason over diagrams, charts, and extracted image content. For more information, see Surface document-embedded images in agentic retrieval (preview).

Using knowledge sources

After you create a knowledge source, reference it in a knowledge base. The knowledge base determines which knowledge sources to query. The following sections describe options for controlling which sources are included and how the engine selects among them.

Always query a knowledge source

Set alwaysQuery to true on a knowledge source definition to include it in every query, regardless of the retrieval reasoning effort.

Use the retrieval reasoning effort to control LLM usage (preview)

The retrieval reasoning effort controls how much LLM processing is applied to each query. Not all solutions benefit from LLM query planning. If simplicity and speed are the priority, use minimal effort to bypass LLM processing. At low and medium effort, the LLM plans and selects which knowledge sources to query, with medium adding an iterative pass for deeper results. For more information about each level, see Set the retrieval reasoning effort (preview).

The following factors inform selection at low and medium effort:

The name of the knowledge source.
The description of an index (for indexed knowledge sources).
The retrievalInstructions specified in the knowledge base definition or the retrieve action. Retrieval instructions guide which knowledge sources the LLM selects or skips. They work like a prompt: you can specify brevity, tone, and formatting.

Feedback

Was this page helpful?

Last updated on 2026-06-12