Arabic Support for Form Recogniser

Shwetha Kumari Anantha 1 Reputation point
2021-05-11T05:28:06.917+00:00

Does Form Recognizer support Arabic text/table extraction?

If not is this on the roadmap?

I see that I am able to extract content from Arabic Digital PDF's. But the output for Scanned Arabic PDF's are worse. The Arabic text is returned in gibberish English and the tables are not extracted at all.

Would appreciate a quick answer.

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,199 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. svijay-MSFT 5,201 Reputation points Microsoft Employee
    2021-05-11T06:46:15.237+00:00

    Hello @Shwetha Kumari Anantha ,

    Thanks for your question. Currently, Form Recognizer doesn't support Arabic Language.

    For the list of supported languages you could refer the below article.

    https://learn.microsoft.com/en-us/azure/cognitive-services/form-recognizer/language-support

    Unfortunately, there is no timeline for the Arabic support at this point of time. Having said that, I would recommend you to voice out your requirement here - Azure Feedback - Cognitive Services. This is where the Product Groups look for features to add.

    1 person found this answer helpful.
    0 comments No comments

  2. Shwetha Kumari Anantha 1 Reputation point
    2021-05-13T04:01:21.76+00:00

    Thank you @svijay-MSFT for the reply.

    Have submitted the idea in the Azure feedback site as suggested.

    People who find this feature necessary please do vote for the idea.

    Link:
    https://feedback.azure.com/forums/932041-azure-cognitive-services/suggestions/43396797-arabic-language-support-for-form-recognizer

    0 comments No comments