@Acharya, Rakesh Thanks for the question, Azure Cognitive Services provides Industry's best optical character recognition (OCR) capability with Read API. The Computer Vision Read API is Azure's latest OCR technology (learn what's new) that extracts printed text (in several languages), handwritten text (English only), digits, and currency symbols from images and multi-page PDF documents. It's optimized to extract text from text-heavy images and multi-page PDF documents with mixed languages. If possible can you please share the sample input images and the output that is unable to extract data.
Also we have built a form recognition service seems promising for your application. Can you please try with the Form Recognizer Layout API that Detects and extracts text and layout of documents.
https://azure.microsoft.com/en-us/services/cognitive-services/form-recognizer/
In the following outlines the traditional challenges of doing OCR in the wild, and what are the ways in which deep learning algorithms are being applied to transform these solutions.
https://twimlai.com/how-deep-learning-has-revolutionized-ocr-with-cha-zhang/
Resources
• Computer Vision
• Microsoft Form Recognizer
• Paper: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
• Paper: LayoutLM: Pre-training of Text and Layout for Document Image Understanding