Read OCR Handwritten text extraction

Question

Read OCR Handwritten text extraction

AzureUser-9588 151

Is it possible to increase the accuracy of identification and extraction of Handwritten text using Azure AI Document Intelligence? The default Read OCR handwritten text extraction for the set of documents that I am using are not satisfactory.

Deepanshu katara 16,720 Reputation points MVP Moderator

2024-02-05T06:25:57.6533333+00:00

Hi , Yes you can do that by using Azure Form Recognizer . Please check below link having all details https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-form-recognizer-is-now-azure-ai-document-intelligence-with/ba-p/3875765

1 answer

Your answer

Deepanshu katara 16,720 Reputation points MVP Moderator

2024-02-05T06:25:57.6533333+00:00

Hi , Yes you can do that by using Azure Form Recognizer . Please check below link having all details https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/azure-form-recognizer-is-now-azure-ai-document-intelligence-with/ba-p/3875765

Answer 1

@AzureUser-9588 Welcome to Microsoft Q&A Forum, Thank you for posting your query here!

Azure AI Document Intelligence Read Optical Character Recognition (OCR) model runs at a higher resolution than Azure AI Vision Read and extracts print and handwritten text from PDF documents and scanned images.

The Read OCR model / document layout model extracts print and handwritten style text as lines and words. This feature applies to supported handwritten languages. Please check if your concerned language is under the supported list.

To extract printed and handwritten text along with barcodes, formulas and font styles from images and documents:

Read model DI studio link: https://documentintelligence.ai.azure.com/studio/read

Layout model DI studio link: https://documentintelligence.ai.azure.com/studio/layout

If you have already tried the above and feel that the identification and accuracy needs to be improved then follow the below:

Action Plan: It is possible to improve the accuracy of handwritten text extraction using Azure AI Document Intelligence. Here are some strategies you can consider:

Custom Models: Custom models generate an estimated accuracy score when trained. Documents analyzed with a custom model produce a confidence score for extracted fields. You can use these scores to interpret the accuracy and improve the results.
Confidence Scores: Document Intelligence analysis results return an estimated confidence for predicted words, key-value pairs, selection marks, regions, and signatures. You can use these confidence scores to determine whether to automatically accept the prediction or flag it for human review.
Training Data: Ensure that all variations of a document are included in the training dataset. This can help produce a model with higher accuracy and confidence scores during analysis and reduce the number of documents flagged for human review.

More Info about this is here. Hope this helps. If you have any follow-up questions, please let me know. I would be happy to help. ** Please do not forget to "Accept the answer” and “up-vote” wherever the information provided helps you, this can be beneficial to other community members.

AzureUser-9588 151 Reputation points

2024-02-05T09:47:45.65+00:00

Does using a custom model can improve the Read OCR Handwritten text character recognition (with AI Document Intelligence) ? Because, in my case certain handwritten characters from a scanned documents/images are not accurately getting identified. For example - a as e, j as i or g etc. Didn't find any articles with combination of custom model and handwritten text accuracy.
navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2024-02-05T10:07:07.05+00:00

@AzureUser-9588 Thanks for your reply. Yes, a custom model can improve accuracy for your scenario. Regarding the custom model creation refer the below 2 articles:

https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/quickstarts/try-document-intelligence-studio?view=doc-intel-4.0.0#custom-models

https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/concept-accuracy-confidence?view=doc-intel-4.0.0

Also, did you explore the OCR functionality of Azure AI Computer Vision service to get the handwritten text from images ? Please see this. Please check and let me know if that is giving better accuracy.

Share via

Read OCR Handwritten text extraction

1 answer

Your answer