OCR ignores numbers in the corner of the document

Vlad R 20 Reputation points
2023-03-23T15:43:01.68+00:00

Hello! I am using the Read model to extract text information from receipts. It seems to ignore numbers that are located in the bottom corner of a document. I am attaching three examples where the Read model has not read the 000 code at the bottom of the receipt.

I tried adding another number to the left of the code manually (to make seem it like "5000", or "1000", etc.), but it still just ignores the code completely.

Can anything be done about that by my side (e.g. image preprocessing, rotating the image, etc.?)

test5.jpg
test4.jpg
test2.jpg

Azure Computer Vision
Azure Computer Vision
An Azure artificial intelligence service that analyzes content in images and video.
316 questions
Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,405 questions
{count} votes

Accepted answer
  1. VasaviLankipalle-MSFT 14,576 Reputation points
    2023-03-29T18:19:47.5033333+00:00

    Hi @Vlad R , Thank you for your patience.

    The current system has a limitation in text detection and rejection. So even if the text contains characters from unknown languages, it can still be detected. However, the text recognizer cannot recognize these unknown characters, resulting in the entire text line being rejected. This can lead to numbers/Latin characters being detected along with the unknown text, causing the recognizer to fail to recognize the unknown characters, and ultimately deleting the entire line, including the recognized characters.

    After checking with the PG team, noticed that this is a known issue. Sorry for the inconveniences.
    Maybe you can try custom models and see if that helps.

    I hope this helps. Let me know if you need more information.

    Regards,
    Vasavi

    -Please kindly accept the answer and vote 'Yes' if you feel helpful to support the community, thanks.

    1 person found this answer helpful.
    0 comments No comments

0 additional answers

Sort by: Most helpful