Vision Studio vs Document Intelligence Studio OCR

sami jan 75 Reputation points
2024-01-25T20:04:23.57+00:00

Hi I just ran the same image for OCR in both Vision Studio and Document Intelligence Studio and there is a big difference in the OCR capabilities b/w both - the image is of an industry schematic of a piping diagram Document Intelligence Studio captures some 99% of the text correct but Vision Studio captures only 90% or so - this happens with the Python API as well If I cut up the same image into smaller pieces e.g. 4 equal parts, Vision Studio quality improves dramatically Is that something that can be optimized? Can I use Vision API and specify some params that will help improve the quality on the larger image itself? BR

Azure Computer Vision
Azure Computer Vision
An Azure artificial intelligence service that analyzes content in images and video.
329 questions
Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,462 questions
0 comments No comments
{count} votes

Accepted answer
  1. VasaviLankipalle-MSFT 15,241 Reputation points
    2024-01-26T02:03:24.16+00:00

    Hello @sami jan , Thanks for using Microsoft Q&A Platform.

    As we know, Document Intelligence Read Optical Character Recognition (OCR) model runs at a higher resolution than Azure AI Vision Read and extracts print and handwritten text from PDF documents and scanned images.

    My suggestion is to try adjusting the resolution of the image to improve OCR accuracy. Generally, higher resolution images generally produce better OCR results. Document scan quality, resolution, contrast, light conditions, rotation, and text attributes such as size, color, and density can all affect the accuracy of OCR results.

    However, it totally depends on your use case, if you are looking to improve the OCR Vision API performance, I would suggest you check these best practices to improve system performance: https://learn.microsoft.com/en-us/legal/cognitive-services/computer-vision/ocr-characteristics-and-limitations?context=%2Fazure%2Fai-services%2Fcomputer-vision%2Fcontext%2Fcontext&view=doc-intel-4.0.0#system-limitations-and-best-practices-to-improve-system-performance

    I hope this helps.

    Regards,

    Vasavi

    -Please kindly accept the answer and vote 'yes' if you feel helpful to support the community, thanks.

    1 person found this answer helpful.
    0 comments No comments

0 additional answers

Sort by: Most helpful