Understanding the workings of the Azure OCR Custom Model.

Question

Understanding the workings of the Azure OCR Custom Model.

Qadir Varawala 11

Things we tried.
We currently train some sample files with multiple tables and key Value Pairs on the PDF.

1) when we train the model well, with 100% accuracy, and the parse a new file which hasnt been trained before, we get have low confidence rates on the data from the new file. Some attributes arent even mapped.

2) When We have similar format files with different data, the accuracy of the model isnt 100%. We get better confidence rates, but still not in the 90's like we expect. The confidence rates are between 20 -80 % .

3) We then believed the way we were tagging the data was a deterrent to getting better confidence scores. We renamed he tags , as table1, table2 etc hoping to get better accuracy while reading the data from the model . We still rceived lower confidence scores.

There are a few points I'd like to understand .

1) When we tag the data, is the model AI going to base future scans on the position of the data in the PDF, or the actuall content itself.

For example, if I tag the word 'Microsoft' in a document... will it look for the word 'Microsoft in all pages of the PDF, or will it look for it in the location based on the files that were trained.

2) What can we do to improve the confidence score of files being scanned in the future ?

3) The Tables we have, are not always in the same coordinates, they would vary based on the data that we have. We found it dificult selecting the entire table, as a couple of columns would get skipped. We tagged each of the values from the table. In case there are more rows than what I have tagged in the model. will I be able to get the data from those additional rows.

2 answers

Your answer

Answer 1

Qadir Varawala 11

Hi Ram ,

This is the too we are currently using . https://fott.azurewebsites.net/#

What is the latest version of the Tool., because I dont seem to be able to map separate sections of the page as different models

Ramr-msft 17,826 Reputation points

2021-08-18T14:28:51.75+00:00

@Qadir Varawala Thanks for the details. Please try out the v2.1 and here is the link for the same.
https://fott-2-1.azurewebsites.net/

Answer 2

Ramr-msft 17,826

@Qadir Varawala Thanks for the question. Can you please share the sample documents and FR version that you are trying. Generally we start with 5 documents as training set and you should be able to add more documents to your training incrementally to see an improvement in the results. If you don’t see improvements after doing that, We can forward to the Forms Recognizer team.
There are few ways you can do it here:
• Create Model for each segment on the page. So if you have 3 segments you would create 3 models and call individual models.
• Same as above but create a compose model. The compose model will automatically classify your document and provide the data for the respective tag/data you’re looking for.
• Create one model for and capture datapoints for each segment in a single or set of tags. After the data is extracted, you can use a function for post processing and break down the values by segments.

Please follow the document to improve the results.

https://github.com/microsoft/knowledge-extraction-recipes-forms

Qadir Varawala 11 Reputation points

2021-08-18T12:20:53.807+00:00

Hi Ram,

Thanks for your suggestions.
I'll look into the links provided .
Will get back after trying things out..

Although Im not sure how you would create a model for each section of the page / pages
Do you have a link to that please. . Thanks
Ramr-msft 17,826 Reputation points

2021-08-18T14:53:20.497+00:00

Thanks for the details. Here is the link to the document: https://learn.microsoft.com/en-us/azure/applied-ai-services/form-recognizer/concept-custom?tabs=fott

What is new here.

Share via

Understanding the workings of the Azure OCR Custom Model.

2 answers

Your answer