OCR Stacked Fractions - Azure Document Intelligence Layout

Tom Tucker 20 Reputation points
2023-09-22T17:14:30.5733333+00:00

While using Layout in the Document Intelligence Studio the OCR on a table does not recognize fractions. Is there a way to customize the OCR to recognize the stacked fractions and convert them to a number slash number like 1/4, 3/8, 7/8?

Original pdf tableImage

Generated table from LayoutImage

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,718 questions
{count} votes

Accepted answer
  1. romungi-MSFT 46,831 Reputation points Microsoft Employee
    2023-09-25T12:09:16.9666667+00:00

    @Tom Tucker The Read & Layout models support formula extraction but, in this case, the stacked fractions are still not being recognized. So, I don't think it is possible to use them with the current pre-built models. Formula detection is an add-on feature that is able to recognize slashed fractions clearly though. See the below image from studio where the extraction is successful.

    Enabling the setting from studio:

    User's image

    For slashed fractions.
    User's image

    If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.