Extract Table data using Azure form Recognizer

Akshay Godase 1 Reputation point
2020-09-15T06:09:37.123+00:00

Hello,

I am working on an invoice processing project using Azure From Recognizer. All the invoices are in PDF format. I am using custom form recognizer with labelling. I can extract some data from PDF like Invoice No, Invoice Date, Amount, etc., but I want to extract table data from the pdf using Azure Form Recognizer, but it is not reading the table correctly.

I have labelled the cells which I need and when the number of rows in the table increases it reads the column correctly, but it is unable to separate the values of each row from each other and returns the whole column as a single value.

I tried to provide more examples, but it is still failing to detect the correct table. Is there any way to extract table data properly from PDF using Azure Form Recognizer?

Scanning the table is an essential requirement for our application, and it will decide if we base our application using Azure Form Recognizer or not.

Please see the below PDF table image and want to extract all row data from all columns.

24765-image.png

If you can point us in the right direction with some documentation on this, then it would be beneficial.

Thanks

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,328 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,340 questions
{count} votes