Form Recognizer: how to analyze a form already split in multiple files (each per page) from the same multipage form

Question

Hello, I have a form to analyze provided in three .jpg files, corrisponding each to a single page of a multipage form. I was wondering if is there possible to feed Form Recognizer with these three separated files to check if they match a label based model and extract data from them. Alternatively I was considering to merge those jpg in a single pdf with a java lib, but this would be the last chance. Thank You

Answer

@-FrancescoFucci- Thanks, If you are extracting only text, tables and selection marks from documents you should use layout, if you also need to extract key value pairs you can train a custom model or use a pre-built (Invoice, Receipts, Business Cards).

Layout results (text, tables and selection marks) are included in all the Analyze outputs (custom and pre-built) in the readResults (text) and pageResults (tables) of the JSON output.

• Layout – extract text, tables selection marks no training required
• Pre-built – Invoice, Receipts, Business Cards – extract values of interest from these type of documents
• Custom – Extract key value pairs trained on your own documents
All of the above will also include the text, tables and selection marks in the results.

check out the Knowledge Extraction Recipes resource for different use cases for post processing. https://github.com/microsoft/knowledge-extraction-recipes-forms

Form Recognizer: how to analyze a form already split in multiple files (each per page) from the same multipage form

1 answer