Tips for labeling a handwritten scorecard Custom Extraction Model

Ercel Concepcion 0 Reputation points
2024-02-16T04:20:55.62+00:00

Hi! I am training a model to extract scores of a bunch of handwritten golf scorecards. I notice after running the layout, the bottom 2-3 rows doesn't get recognized as individual cells, but the first 2 rows are fine. This is a common occurence in the 30+ scorecards I labeled. User's image

Below is the result after running the analysis using the model I created. The 4th and 5th row was not recognized although it looks the same as the first 2 rows. User's image

Do you have any suggestions on how to improve this result? Should I use more training data? Is it better to use region when labeling a table? Is it better to use other formats like PDF or JPG? I am currently using PNG file. The files are photos of the cards and and not scanned, although it was good image quality (2mb each file) does it have an effect on the outcome? Thank you in advance!

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,620 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.