Copilot agent can't read PDF files

Intranet AI 40 Reputation points
2025-06-11T08:28:16.2733333+00:00

I'm creating a KB agent with my client and they have added a knoweldge base to the agent full of PDF files. They can get answers on the contents from the normal Copilot Chat, but the agent isn't capable to give information about the contents and only guesses from the title.

Q: Should we just change PDF type? Is there some limit to the agents?

Key info:

  • The agent is being created with Agent Builder.
  • The KB is a folder in a SharePoint site.
  • The PDFs are the image type (so you can't select the text). However, feeding these PDFs to the normal Copilot Chat lets them get answers about the contents. Inside the agent, this doesn't work.

I haven't found people in a similar situation so any help is welcome. Just please stay on theme and don't send me elsewhere without having someone ready to answer over there. I'd like for this question to get a response instead of gathering dust, if possible.

Microsoft 365 and Office | SharePoint | For business | Windows
0 comments No comments
{count} vote

Accepted answer
  1. Jack-Bu 2,220 Reputation points Microsoft External Staff Moderator
    2025-06-11T11:50:02.49+00:00

    Hi Intranet AI

    Good day! Thanks for reaching out to the Microsoft Q&A forum.  

    To confirm, you're building an agent using Copilot Studio Agent Builder, and you've added a knowledge base consisting of PDF files stored in a SharePoint folder. While Copilot Chat is able to extract and answer questions based on the content of these PDFs, the agent you've built is only able to reference the file titles and cannot access the actual content, particularly when the PDFs are image-based (i.e., scanned documents without selectable text). 

    After reviewing the official Microsoft documentation on Use the Copilot Studio Agent Builder to Build Agents | Microsoft Learn, here are the key findings: 

    • File Type Support 
      • The documentation does not list any image-based formats (e.g., JPG, PNG, TIFF) as supported file types for knowledge ingestion. 
      • While PDF is listed as a supported format, it is highly likely that only text-based PDFs are processed correctly by the Agent Builder. 

    To address this limitation, you may consider the following steps: 

    1. Convert image-based PDFs to text-based PDFs using OCR tools. 
    2. Alternatively, convert the content into other supported text formats such as .txt, .docx, or .html. 

    Before re-uploading or re-indexing the files, please ensure the following: 

    • Permissions: Verify that Copilot has the appropriate access rights to the SharePoint folder and its contents. 
    • File Availability: Confirm that the files are marked as available for Copilot knowledge ingestion. 
    • Restricted SharePoint Search: As noted in the documentation:  “If Restricted SharePoint Search is enabled, you won't be able to use SharePoint as a knowledge source.” 

    This setting can prevent the agent from accessing any SharePoint-based content, regardless of file format or permissions, so it’s important to verify that this restriction is not enabled. You can read more about this setting via https://learn.microsoft.com/en-us/sharepoint/restricted-sharepoint-search  

    I hope these suggestions help, and I sincerely wish you the best in resolving this issue quickly. Please let me know if there's anything else I can assist you with. 


    If the answer is helpful, please click "Accept Answer" and kindly upvote it. If you have extra questions about this answer, please click "Comment".         

    Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.


1 additional answer

Sort by: Most helpful
  1. Intranet AI 40 Reputation points
    2025-06-11T12:16:32.96+00:00

    I got a notification that you responded to the question but I can't see your answer here in the page, only the first snippet from the notification email.

    Edit: now I can see your reply

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.