Improve the performance of vertical text recognition for Form Recognizer Read API

Kenrick Fernandes 5 Reputation points
2023-07-19T21:21:32.99+00:00

Hello,

We have been working on a computer vision use case that involves performing OCR on images using Form Recognizer. The text that appears in the images includes both horizontal and vertical alphanumeric texts. We have tested the Form Recognizer Read API extensively and found it underperforms in vertical text detection and recognition, especially when the text is alphanumeric. It looks like the read models were not trained on alphanumeric vertical text; we have not yet seen it recognize one alphabet in all our vertical text samples.
Attaching a few samples for you reference.

MicrosoftTeams-image (5)

MicrosoftTeams-image (7)

MicrosoftTeams-image (6)

Do you have any ideas on how we can improve the recognition of vertical text?

Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
1,459 questions
Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
2,116 questions
{count} vote

1 answer

Sort by: Most helpful
  1. VasaviLankipalle-MSFT 18,676 Reputation points Moderator
    2023-07-20T05:45:57.9966667+00:00

    Hello @Kenrick Fernandes , welcome to Microsoft Q&A Platform.

    Thank you for providing the detailed information and bringing this to our notice. I just reproduced using latest preview API version yes it looks like that the Form Recognizer Read API is not extracting accurately. Will share this feedback to the product team.

    If your scenario is something related to extract labels, street signs and posters please use Azure AI Vision v4.0 preview Read feature as mentioned here. https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/concept-read?view=doc-intel-3.0.0

    For extracting text from external images like labels, street signs, and posters, use the Azure AI Vision v4.0 preview Read feature optimized for general, non-document images with a performance-enhanced synchronous API that makes it easier to embed OCR in your user experience scenarios.Try custom model in FR studio and see if it helps.

    I hope this helps.

    Regards,
    Vasavi

    -Please kindly accept the answer and vote 'yes' if you feel helpful to support the community, thanks.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.