Hi Bryan Bates,
Welcome to Microsoft Q&A and Thank you for reaching out.
I understand your experiencing long delays sometimes 20 to 30 minutes or more when using the Document Intelligence Read model, even though it used to return results within a couple of seconds, there are several common reasons this might be happening.
The size and complexity of the documents themselves play a big role. Larger files with more pages naturally take longer to process. Documents that include scanned images, handwriting, tables, or high-resolution pictures require more time and computing power to analyze. Even small changes like higher image quality or additional embedded content can significantly increase processing time.
Document Intelligence is a multitenant, asynchronous microservice. This means it shares resources across many users and processes requests asynchronously. Because of this architecture, latency can vary even for similar documents and there is no guaranteed SLA for latency and processing times can fluctuate depending on backend capacity, operational load, or maintenance activities.
API versions (like 2024-11-30) and the use of custom extraction models have been reported to cause increased processing times or resource exhaustion in some regions. Custom models usually require more processing than prebuilt ones and may be more sensitive to backend resource availability or recent software updates.
Your current service tier (S0) provides good baseline capacity but has limitations compared to higher tiers or dedicated resources. If there is increased demand in the Canada Central region or if you hit quota limits, your requests may be queued or throttled, leading to longer wait times. Additionally, regional outages or partial service degradations can cause delays.
Sometimes network issues or the size of the documents being uploaded and downloaded can add to delays. Poorly formatted or corrupted documents may also require extra time for the system to process, causing longer latency than usual.
To troubleshoot, start by comparing documents you process now versus earlier ones to check for size or complexity differences. Monitor Azure service health for Canada Central to see if there are ongoing incidents. Review your resource usage metrics for throttling or error spikes and ensure you aren’t hitting quota limits. You might also want to try processing documents with prebuilt models or different API versions to compare performance.
Consider breaking large documents into smaller batches or scheduling processing during off-peak hours to reduce load. Implement retry and timeout logic in your workflow to gracefully handle occasional long-running requests without stalling the entire process.
For your References:
Troubleshooting Issues
Azure Service Health Overview
I Hope this helps. Do let me know if you have any further queries.
Thank you!