How to get character coordinates in Computer Vision API?

Zhivko-2008 20 Reputation points
2023-01-27T00:03:02.6033333+00:00

The Computer Vision Form Recognizer read (OCR) model response contains bounding boxes for words, but no character/glyph positions. Is there any way to get individual character positions?

Azure Computer Vision
Azure Computer Vision
An Azure artificial intelligence service that analyzes content in images and video.
401 questions
Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,911 questions
{count} votes

Accepted answer
  1. romungi-MSFT 48,526 Reputation points Microsoft Employee
    2023-01-30T14:03:41.3733333+00:00

    Zhivko The OCR and Read API can provide only the co-ordinates of each word from the lines that include all extracted words with their coordinates and confidence scores. Currently, there is no setting to get the individual character positions in the response.

    -Please kindly accept the answer if the answer was helpful to support the community, thanks.


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.