Copilot Studio Knowledge indefinitely processing

Question

Copilot Studio Knowledge indefinitely processing

Himanshu Sharma 0

Copilot Studio Knowledge indefinitely processing User's image

Ashlesha Pawar (Tata Consultancy Services Limited) 320 Reputation points Microsoft External Staff

2026-06-19T09:47:08.85+00:00
Could you please confirm if the AI response actually resolves your query, or are you still observing the issue where the knowledge source remains stuck in “In Progress”?

It is correct that knowledge ingestion is asynchronous and can take time due to indexing (chunking + embeddings).

However, stating that “In Progress” can remain indefinitely is not accurate in normal behavior.

If the status is stuck for a long period, it typically indicates a stalled or failed ingestion process, not expected system delay.

Likely causes to validate:

Connector/authentication issues (SharePoint / website / file source)

Permission or access failures during ingestion

Dataverse/vector indexing job stuck or retry failure

Environment-level service or indexing backlog issue

1 answer

Your answer

Ashlesha Pawar (Tata Consultancy Services Limited) 320 Reputation points Microsoft External Staff

2026-06-19T09:47:08.85+00:00

Could you please confirm if the AI response actually resolves your query, or are you still observing the issue where the knowledge source remains stuck in “In Progress”?

It is correct that knowledge ingestion is asynchronous and can take time due to indexing (chunking + embeddings).

However, stating that “In Progress” can remain indefinitely is not accurate in normal behavior.

If the status is stuck for a long period, it typically indicates a stalled or failed ingestion process, not expected system delay.

Likely causes to validate:

Connector/authentication issues (SharePoint / website / file source)

Permission or access failures during ingestion

Dataverse/vector indexing job stuck or retry failure

Environment-level service or indexing backlog issue

Answer 1

Copilot Studio knowledge sources are indexed asynchronously and can remain in an “In progress” state for an extended period while content is being crawled, chunked, and vector-indexed in Dataverse. This behavior is expected, especially for larger or changing data sets, because:

Files and external content are ingested, chunked, and converted into semantic indexes and vector embeddings before they can be used for grounding responses.
SharePoint, OneDrive, and other unstructured sources are not indexed in real time; there can be noticeable delay between upload/changes and when the agent can fully use them.
For SharePoint-based knowledge, content is cached; removing or changing files does not immediately update what the agent “sees.” There is no way to force immediate re-indexing or purge the cache per session.

If knowledge items appear stuck “In progress,” the practical implications are:

The agent might still only use a partial extract of the data until indexing completes. For unpublished agents, only a limited subset of the data may be processed.
For dynamic or per-session content, Copilot Studio knowledge ingestion is not suitable because indexing and cache invalidation are not instantaneous.

Recommended actions based on the documented behavior:

Allow time for indexing
- For websites and other unstructured data, Copilot Studio refreshes and re-indexes content periodically (for websites, typically within 24 hours). A persistent “In progress” state shortly after adding or updating a source can be normal during this window.
Publish the agent after configuring knowledge
- Ensure the agent is published after adding or updating knowledge sources. Unpublished agents may only process a small extract of the data, which can look like incomplete or stuck processing.
Avoid per-session or rapidly changing files as knowledge
- Knowledge ingestion is optimized for static or semi-static content. For scenarios where files must be uploaded and removed per session, and answers must only use that session’s files, Copilot Studio knowledge sources are not appropriate because:
  - Indexing is asynchronous and not real time.
  - Cached content from previous files can still be used even after deletion.
Use alternative patterns for dynamic data
- For highly dynamic or per-session data, consider:
  - Using real-time connectors or tools (for example, Power Platform connectors) to fetch data at query time instead of indexing it as knowledge.
  - Designing the solution so that knowledge sources are relatively stable, and dynamic aspects are handled via tools/actions rather than uploaded knowledge.

If the “In progress” status persists well beyond expected indexing windows (for example, more than a day for static content) and the agent remains unable to use the knowledge, raise a support ticket with full details (tenant, environment, type and size of knowledge source, and timestamps) so the service team can investigate backend indexing issues.

References:

Share via

Copilot Studio Knowledge indefinitely processing

Likely causes to validate:

1 answer

Your answer