Vision Studio vs Document Intelligence Studio OCR

Question

Hi I just ran the same image for OCR in both Vision Studio and Document Intelligence Studio and there is a big difference in the OCR capabilities b/w both - the image is of an industry schematic of a piping diagram Document Intelligence Studio captures some 99% of the text correct but Vision Studio captures only 90% or so - this happens with the Python API as well If I cut up the same image into smaller pieces e.g. 4 equal parts, Vision Studio quality improves dramatically Is that something that can be optimized? Can I use Vision API and specify some params that will help improve the quality on the larger image itself? BR

Accepted Answer

Hello @sami jan , Thanks for using Microsoft Q&A Platform.

As we know, Document Intelligence Read Optical Character Recognition (OCR) model runs at a higher resolution than Azure AI Vision Read and extracts print and handwritten text from PDF documents and scanned images.

My suggestion is to try adjusting the resolution of the image to improve OCR accuracy. Generally, higher resolution images generally produce better OCR results. Document scan quality, resolution, contrast, light conditions, rotation, and text attributes such as size, color, and density can all affect the accuracy of OCR results.

However, it totally depends on your use case, if you are looking to improve the OCR Vision API performance, I would suggest you check these best practices to improve system performance: https://learn.microsoft.com/en-us/legal/cognitive-services/computer-vision/ocr-characteristics-and-limitations?context=%2Fazure%2Fai-services%2Fcomputer-vision%2Fcontext%2Fcontext&view=doc-intel-4.0.0#system-limitations-and-best-practices-to-improve-system-performance

I hope this helps.

Regards,

Vasavi

-Please kindly accept the answer and vote 'yes' if you feel helpful to support the community, thanks.

Share via

Vision Studio vs Document Intelligence Studio OCR

0 additional answers