Form Recognizer: how to analyze a form already split in multiple files (each per page) from the same multipage form

-FrancescoFucci- 1 Reputation point
2021-03-22T18:56:10.467+00:00

Hello, I have a form to analyze provided in three .jpg files, corrisponding each to a single page of a multipage form. I was wondering if is there possible to feed Form Recognizer with these three separated files to check if they match a label based model and extract data from them. Alternatively I was considering to merge those jpg in a single pdf with a java lib, but this would be the last chance. Thank You

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,365 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Ramr-msft 17,611 Reputation points
    2021-03-23T13:23:10.19+00:00

    @-FrancescoFucci- Thanks, If you are extracting only text, tables and selection marks from documents you should use layout, if you also need to extract key value pairs you can train a custom model or use a pre-built (Invoice, Receipts, Business Cards).

    Layout results (text, tables and selection marks) are included in all the Analyze outputs (custom and pre-built) in the readResults (text) and pageResults (tables) of the JSON output.

    • Layout – extract text, tables selection marks no training required
    • Pre-built – Invoice, Receipts, Business Cards – extract values of interest from these type of documents
    • Custom – Extract key value pairs trained on your own documents
    All of the above will also include the text, tables and selection marks in the results.

    check out the Knowledge Extraction Recipes resource for different use cases for post processing. https://github.com/microsoft/knowledge-extraction-recipes-forms