Building Form Recognizer with labels and output required in pdf

amitlalazure503 26 Reputation points
2021-03-08T16:49:00.193+00:00

Hello Members,

Started working Form Recognizer by building a training model for our custom pdfs sources.
And trying to extract the selective tables/images from those 100 paged PDF. To train this custom model, we uploaded 10 different PDF versions and now successfully receiving all outputs in JSON very well with a 90% + score on the training model.

Our baseline question => How to convert JSON output received from those PDFs to a similar PDF format?
Any inputs are highly appreciated.

Regards,
Amit Lal

Azure Form Recognizer
Azure Form Recognizer
An Azure service that applies machine learning to extract text, key/value pairs, tables, and structures from documents.
643 questions
{count} votes

2 answers

Sort by: Most helpful
  1. YutongTie-MSFT 24,466 Reputation points Microsoft Employee
    2021-03-08T23:19:51.63+00:00

    Hello Amit,

    Thanks for reaching out to us. There is no way to output pdf from form recognizer, but you can use logic apps to do it (Form recognizer as a part). There is a sample solution for you please feel free to refer to it.

    https://powerusers.microsoft.com/t5/Building-Flows/Extracting-PDF-data-with-Form-Recognizer-and-saving-it-to/td-p/429459

    And the document for Logic apps
    https://azure.microsoft.com/en-us/services/logic-apps/

    Regards,
    Yutong

    No comments

  2. amitlalazure503 26 Reputation points
    2021-03-10T15:29:29.853+00:00

    Hi Yutong,
    Thanks for your input. I understand Logic apps required here.
    The bigger question can form recognizer layout API able to fetch selective images and tables from pdf report? If yes, please share some insight/GitHub etc.

    Thank you,
    Amit Lal