A huge latency added to Document OCR request

Ahmad Hammoudeh 0 Reputation points
2024-05-12T15:31:24.0633333+00:00

I'm experiencing a latency issue with the Document Intelligence service while applying OCR on a large number of images of books. Each image takes about 5 seconds to respond, which I believe is not solely due to the OCR step. I have checked the Azure resource request latency metric, and the latency for my request is around 100ms. After spending time researching the issue, I have ruled out the latency of sending the image itself cause it's light (image info is mentioned below), so I am considering factors such as the asynchronous nature of the request, how the Azure server handles the request, and the tier of the service.

Regarding the issue, I have several questions:

  • What could be the real reason behind this latency, is there something that I have missed?
  • Regarding my configurations and image info mentioned below, is this latency normal?
  • I am contemplating using Azure functions with Azure storage to process a large number of documents I have. Will I expect the same response time for each image?

Configurations:

  • Free Tier F0, Location East US, Version 4.0
  • SDK version = 1.0.0b2, python
  • Model type: Read
  • Environment: Google Colab, the server located in the US
  • I followed the quick start tutorial to send the request

Image info:

  • Link to the image
  • File Type: TIFF
  • File Size: 173 KB
  • Dimensions: 2339 x 1654

Thanks for your time.

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,443 questions
{count} votes