Building Form Recognizer with labels and output required in pdf

amitlalazure503 26

Hello Members,

Started working Form Recognizer by building a training model for our custom pdfs sources.
And trying to extract the selective tables/images from those 100 paged PDF. To train this custom model, we uploaded 10 different PDF versions and now successfully receiving all outputs in JSON very well with a 90% + score on the training model.

Our baseline question => How to convert JSON output received from those PDFs to a similar PDF format?
Any inputs are highly appreciated.

Regards,
Amit Lal

YutongTie-MSFT 52,956 Reputation points

2021-03-09T06:25:02.923+00:00

Please let me know if you have any question according to it. Thanks.

2 answers

YutongTie-MSFT 52,956 Reputation points

2021-03-08T23:19:51.63+00:00

Hello Amit,

Thanks for reaching out to us. There is no way to output pdf from form recognizer, but you can use logic apps to do it (Form recognizer as a part). There is a sample solution for you please feel free to refer to it.

https://powerusers.microsoft.com/t5/Building-Flows/Extracting-PDF-data-with-Form-Recognizer-and-saving-it-to/td-p/429459

And the document for Logic apps
https://azure.microsoft.com/en-us/services/logic-apps/

Regards,
Yutong
Please sign in to rate this answer.

0 comments No comments
Sign in to comment

Use comments to ask for clarification, additional information, or improvements to the question.
amitlalazure503 26 Reputation points

2021-03-10T15:29:29.853+00:00

Hi Yutong,
Thanks for your input. I understand Logic apps required here.
The bigger question can form recognizer layout API able to fetch selective images and tables from pdf report? If yes, please share some insight/GitHub etc.

Thank you,
Amit Lal
Please sign in to rate this answer.
YutongTie-MSFT 52,956 Reputation points

2021-03-16T16:47:00.75+00:00

Hello Amit,

Thanks for the response. For tables, yes, please refer to the document: https://learn.microsoft.com/en-us/azure/cognitive-services/form-recognizer/concept-layout?

For image, I am checking internal to see if there any raodmap here.

Regards,
Yutong

amitlalazure503 26 Reputation points

2021-03-23T21:15:43.277+00:00

Hello Yutong, Thanks for your inputs. Any Github ref. for Tables fetching, that should help.
Perhaps, I'll wait for your inputs on the image fetching. Thank you,
Amit

YutongTie-MSFT 52,956 Reputation points

2021-03-24T08:29:35.967+00:00

Hello,

What's kind of GitHub Reference you want? I think below two are enough. The first one is the introduce and the second one is the API reference.

https://learn.microsoft.com/en-us/azure/cognitive-services/form-recognizer/concept-layout#tables

https://westcentralus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-v2-1-preview-3/operations/AnalyzeLayoutAsync

Regards,
Yutong

amitlalazure503 26 Reputation points

2021-03-27T15:08:10.927+00:00

Thanks for your response again. I'm looking for selective images back on output as well. You mentioned checking internally and roadmap etc. Hence curious if that is available now or in the future.
Thank you,
Amit
Sign in to comment

Use comments to ask for clarification, additional information, or improvements to the question.

Share via

Building Form Recognizer with labels and output required in pdf

2 answers

Your answer