Share via

how to convert a pdf to text

Reuben, Jacqueline 0 Reputation points
2026-03-19T15:39:24.3166667+00:00

I have a large pdf document that I would like to convert to text to more easily navigate the sections and headers.

Microsoft 365 and Office | Other
0 comments No comments

1 answer

Sort by: Most helpful
  1. Q&A Assist
    2026-03-19T15:39:40.0533333+00:00
    AI generated content. This question contains an answer created with AI from Q&A Assist. Learn more

    In Microsoft 365, there are two main ways to get text out of a PDF so it’s easier to navigate and work with.

    1. Convert the PDF to an editable Word document

    This is usually the most straightforward way to get structured text (headings, paragraphs) from a PDF:

    1. Open Word (desktop or Microsoft365.com).
    2. Go to File > Open.
    3. Browse to and select the PDF file.
    4. When prompted that Word will make a copy and convert the PDF to an editable Word document, select OK.
    5. After conversion, the content will be in Word format. You can:
      • Use Navigation Pane (View > Navigation Pane) to move through headings.
      • Edit, search, and reorganize text as needed.
    6. If needed, save as:
      • Word document: File > Save As, choose Word Document (.docx).
      • Plain text: File > Save As, choose Plain Text (.txt).

    Notes:

    • This works best for PDFs that are mostly text. Complex layouts, scanned pages, or copier-like manuscripts may not convert cleanly, and page/line breaks may differ from the original.
    1. Extract text from PDF using Power Automate for desktop

    If using Power Automate for desktop and want to programmatically extract text (for example, from a large PDF or many PDFs):

    1. In Power Automate for desktop, create or edit a desktop flow.
    2. Add the Extract text from PDF action.
    3. Configure:
      • PDF file: Path to the PDF.
      • Page(s) to extract: All, Single, or Range.
      • If needed, set Password for protected PDFs.
      • Optionally enable Optimize for structured data to better detect formatted layout.
    4. Run the flow. The action outputs a variable, typically ExtractedPDFText, containing the extracted text.
    5. Use that text variable to write to a file, process further, or load into another system.

    This approach is useful when automating extraction from large or multiple PDFs.


    References:

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.