How to get character coordinates in Computer Vision API?

Zhivko-2008 20 Reputation points
2023-01-27T00:03:02.6033333+00:00

The Computer Vision Form Recognizer read (OCR) model response contains bounding boxes for words, but no character/glyph positions. Is there any way to get individual character positions?

Azure Computer Vision
Azure Computer Vision
An Azure artificial intelligence service that analyzes content in images and video.
312 questions
Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,389 questions
{count} votes

Accepted answer
  1. romungi-MSFT 42,286 Reputation points Microsoft Employee
    2023-01-30T14:03:41.3733333+00:00

    Zhivko The OCR and Read API can provide only the co-ordinates of each word from the lines that include all extracted words with their coordinates and confidence scores. Currently, there is no setting to get the individual character positions in the response.

    -Please kindly accept the answer if the answer was helpful to support the community, thanks.


0 additional answers

Sort by: Most helpful