How to Enhance Azure OCR Detection in Arabic Data

Mayar Alzerki 20 Reputation points
2024-02-28T14:42:45.1233333+00:00

Hello everyone, I'm using Azure Computer Vision service to detect Arabic text in images with table structure. In some cases, the results I'm getting are not accurate.

As you can see in the attached image, the Arabic dates (right column) are not being detected, and the header is being considered as a one cell.

How can I solve this? Knowing that I'm using Img2Table library in python to detect the text in the cells as one piece of text.

User's image

Thank you!

Azure Computer Vision
Azure Computer Vision
An Azure artificial intelligence service that analyzes content in images and video.
311 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,386 questions
0 comments No comments
{count} votes

Accepted answer
  1. dupammi 6,480 Reputation points Microsoft Vendor
    2024-02-28T16:35:26.6+00:00

    Hi @Mayar Alzerki

    Thank you for using the Microsoft Q&A forum.

    The results of the OCR API as mostly based on the quality of the image and the requirements should confer to these pre-requisites.

    To enhance Azure OCR detection in Arabic data, you can try using the OCR API's language, detectOrientation to detect OcrRegion, textAngle, and languageDetectionMode parameters. Additionally, you can review the Img2Table library documentation and Azure OCR API documentation to ensure they are being used correctly.

    If the issue persists, you can try preprocessing the image or using Custom Vision to train the models to recognize specific objects or text, and eventually improve the accuracy of the OCR service.

    Hope this helps.


    Please do not forget to click Accept Answer and Yes for was this answer helpful, wherever the information provided helps you. This can be beneficial to other community members.

    0 comments No comments

0 additional answers

Sort by: Most helpful