Form Recognizer Handle Multiple of Same Form in PDF

Question

I'm having good success with Form Recognizer, but am not understanding how pages work. I have a one page form, but I want to allow my users to upload a PDF with multiple versions / scans of this form. It allows me to submit a 2 page document (same form different values on page 1 and page 2) It detects multiple pages, however I only get the values from the first page / form. I'm considering using a tool to split the pdf into seperate pages, but wondering if this is necessary, it would be great if Form Recognizer could analyze all the pages as unique copys of the form and I could parse the results.

Accepted Answer

@Eden Corbin Thanks for the question. Currently this feature does not exist and we have forwarded this feedback to our product team. You can also raise a user voice request here so the community can vote and provide their feedback, the product team then checks this feedback and implements the feature in future releases.

For Input requirements:
https://learn.microsoft.com/en-us/azure/cognitive-services/form-recognizer/overview?tabs=v2-1#input-requirements

Answer

I've hit the same issue as the original poster - we've got a lot of multi-page tables where we have header labels on the top of the first page, then a table which continues onto a second or third page. The columns headers are the same on the continuation pages, but the values obviously change. It's a lot of extra work to then recombine the outputs from several calls to the API for the split out pages, back into one output file that represents the original input data.
It would be good when training the model to to have an option to 'add more rows from next page' or something similar, so that all the table data can be extracted together for a multi-page pdf table.

Answer

I have the same problem, this feature will be highly desired

Answer

looking for the same thing.

Answer

If you know the forms are on each page, you can simply use a pdf splitter and recognize them individually.

But would be nice to have that split functionality built into forms recognizer :)

Share via

Form Recognizer Handle Multiple of Same Form in PDF

6 additional answers

Your answer