Thanks for reaching out to us. I understand currently you are working on the OCR Read Container and you want to improve the speed.
First, I hope you are using the latest V3.2 container, this will make sure you have the better experience generally.
When you are working on big throughput, as you mentioned, multipage document, a good way to try is leveraging multiple containers on a Kubernetes cluster, using Azure Storage and Azure Queue.
Starting in v3 of the container, you can use the containers in parallel on both a task and page level.
By design, each v3 container has a dispatcher and a recognition worker. The dispatcher is responsible for splitting a multi-page task into multiple single page sub-tasks. The recognition worker is optimized for recognizing a single page document. To achieve page level parallelism, deploy multiple v3 containers behind a load balancer and let the containers share a universal storage and queue.
Please refer to the document for the step by step guidance, I hope this helps!
https://learn.microsoft.com/en-us/azure/cognitive-services/computer-vision/deploy-computer-vision-on-premises#deploy-multiple-v3-containers-on-the-kubernetes-cluster
Regards,
Yutong
-Please kindly accept the answer if you feel helpful, thanks!