Share via

Copilot Studio Knowledge indefinitely processing

Himanshu Sharma 0 Reputation points
2026-06-18T10:21:53.93+00:00

Copilot Studio Knowledge indefinitely processingUser's image

Microsoft Copilot | Windows Copilot | App

1 answer

Sort by: Most helpful
  1. AI answer

    2026-06-18T10:22:10.9733333+00:00

    Copilot Studio knowledge sources are indexed asynchronously and can remain in an “In progress” state for an extended period while content is being crawled, chunked, and vector-indexed in Dataverse. This behavior is expected, especially for larger or changing data sets, because:

    • Files and external content are ingested, chunked, and converted into semantic indexes and vector embeddings before they can be used for grounding responses.
    • SharePoint, OneDrive, and other unstructured sources are not indexed in real time; there can be noticeable delay between upload/changes and when the agent can fully use them.
    • For SharePoint-based knowledge, content is cached; removing or changing files does not immediately update what the agent “sees.” There is no way to force immediate re-indexing or purge the cache per session.

    If knowledge items appear stuck “In progress,” the practical implications are:

    1. The agent might still only use a partial extract of the data until indexing completes. For unpublished agents, only a limited subset of the data may be processed.
    2. For dynamic or per-session content, Copilot Studio knowledge ingestion is not suitable because indexing and cache invalidation are not instantaneous.

    Recommended actions based on the documented behavior:

    1. Allow time for indexing
      • For websites and other unstructured data, Copilot Studio refreshes and re-indexes content periodically (for websites, typically within 24 hours). A persistent “In progress” state shortly after adding or updating a source can be normal during this window.
    2. Publish the agent after configuring knowledge
      • Ensure the agent is published after adding or updating knowledge sources. Unpublished agents may only process a small extract of the data, which can look like incomplete or stuck processing.
    3. Avoid per-session or rapidly changing files as knowledge
      • Knowledge ingestion is optimized for static or semi-static content. For scenarios where files must be uploaded and removed per session, and answers must only use that session’s files, Copilot Studio knowledge sources are not appropriate because:
        • Indexing is asynchronous and not real time.
        • Cached content from previous files can still be used even after deletion.
    4. Use alternative patterns for dynamic data
      • For highly dynamic or per-session data, consider:
        • Using real-time connectors or tools (for example, Power Platform connectors) to fetch data at query time instead of indexing it as knowledge.
        • Designing the solution so that knowledge sources are relatively stable, and dynamic aspects are handled via tools/actions rather than uploaded knowledge.

    If the “In progress” status persists well beyond expected indexing windows (for example, more than a day for static content) and the agent remains unable to use the knowledge, raise a support ticket with full details (tenant, environment, type and size of knowledge source, and timestamps) so the service team can investigate backend indexing issues.


    References:

    AI-generated content may be incorrect. Read our transparency notes for more information.

    Was this answer helpful?

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.