Could you please confirm if any resolution or fix has been implemented for this issue?
Azure Document Intelligence Python SDK Returns Data Only for First Page
Hi - I am encountering an issue while using the azure.ai.documentintelligence Python library to extract data from a PDF using page ranges.
My application processes the document page by page, and this workflow was working correctly until this morning. However, starting today, when I attempt to extract data for specific pages, the service returns result only for the first page, regardless of the page range specified.
There have been no code changes on my side. Has anyone else experienced this issue, or is there a recent service update or known limitation affecting page-range extraction?
Azure AI services
3 answers
Sort by: Most helpful
-
-
Kiran Dhumma 25 Reputation points
2025-12-22T14:43:16.44+00:00 This issue has been resolved in latest update.
-
Anshika Varshney 6,195 Reputation points Microsoft External Staff Moderator
2025-12-25T04:41:27.6466667+00:00 Hey Kiran Dhumma,
Thank you for reaching out on the Microsoft Q&A.
It looks like you’re running into an issue with the Azure Document Intelligence Python SDK, where it's only processing the first page of your PDF, even when specifying a range of pages. This can be frustrating, especially since you haven't changed your code.
- Here are a few things to consider:
- SDK Parameter Check: First, ensure that you are correctly using the
pagesparameter in youbegin_analyze_documentcall. In the newer SDK versions, it's crucial that thepagesparameter is set up correctly. Refer to the documentation for proper usage. - Subscription Tier: Since you're using the version
1.0.2of the SDK, please confirm that your Azure Document Intelligence resource is not on the F0 (free) tier. The free tier only analyzes the first two pages of a PDF, regardless of the specified page range. If you are on the free tier, upgrading to the S0 (standard) tier will allow full multi-page extraction. - Check Document Type: Ensure that your document is indeed a PDF. Other formats like DOCX or XLSX may not support the same page-based processing as PDFs do.
- Service Updates or Limitations: If this behavior started recently and there haven't been any code changes on your part, it could be worth checking the Azure updates or forums for any announcements about service changes that might impact the Document Intelligence processing.
- Retry Logic: The retry logic in your code looks good, but it’s also important to check if any rate limiting is coming into play. This can sometimes affect results if repeated calls are made too quickly.
- SDK Parameter Check: First, ensure that you are correctly using the
I Hope this helps. Do let me know if you have any further queries.
If this answers your query, please do click
Accept AnswerandYesfor was this answer helpful.Thank you!
- Here are a few things to consider: