How to use Azure Cognitive Services to create different PDFs based on types from a single PDF?

Snehal Shah 0 Reputation points
2023-07-25T13:26:25.8066667+00:00

Hi experts,

I have several multi-page PDFs from multiple companies, each contains bank statements, purchase invoices from various suppliers and sales invoices. I would like to segregate each type into a separate PDF using Azure Cognitive Services. Can you please suggest the logic and services required (such as Form Recognizer) to achieve this task?

Thanks

Azure AI Search
Azure AI Search
An Azure search service with built-in artificial intelligence capabilities that enrich information to help identify and explore relevant content at scale.
1,339 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,598 questions
{count} votes

3 answers

Sort by: Most helpful
  1. Azar 29,520 Reputation points MVP Volunteer Moderator
    2023-07-25T15:52:32.6566667+00:00

    Hi @Snehal Shah

    follow these steps to segregate each type into a separate PDF using Azure Cognitive Services

    1. Use Azure Form Recognizer to extract text and key-value pairs from PDFs.
    2. Implement document classification logic based on the extracted data.
    3. Split the PDFs into separate files for each document type.
    4. Save the segregated PDFs into separate folders.

    Documentation link for Azure Form Recognizer:

    if this helps kindly accept the asnwer, for assiatance ping here thanks much


  2. Grmacjon-MSFT 19,151 Reputation points Moderator
    2023-07-27T23:27:31.37+00:00

    Hi @Snehal Shah ,

    Adding to Azar's response - I want to point out that Azure Form Recognizer is now called "Azure AI Document Intelligence ".

    For your scenario, consider using: Document Intelligence layout model

    Document Intelligence layout model is an advanced machine-learning based document analysis API available in the Document Intelligence cloud. It enables you to take documents in various formats and return structured data representations of the documents. It combines an enhanced version of our powerful Optical Character Recognition (OCR) capabilities with deep learning models to extract text, tables, selection marks, and document structure.

    Hope that helps

    Best,

    Grace

    0 comments No comments

  3. Kalyan Chakravarthi Bondala 0 Reputation points
    2025-04-16T17:27:00.19+00:00

    Hi @Azar Azar
    I have a use case when I have pdf with 20 purchase orders, each purchase orders has a table of linetimes(description, externalItemid, quantity), also for each purchase order I have fields like shipto address, PO Number etc.,
    How do I Extract those orders.

    1. Can I Split the PDF into individua orders then train.
    2. Is there a way to return list of orders through JSON.

    Have researched, but did not find a solution Yet.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.