Hello @Thibault Verlinde , Thanks for using Microsoft Q&A Platform.
Yes, this is the expected behavior. In the Word document, up to 3,000 characters is considered one page unit. Additionally, there is no bounding polygon or bounding region information for each detected object, and page range (pages) is not supported as a parameter.
Versions
2024-02-29-preview
,2023-10-31-preview
, and later support Microsoft office (DOCX, XLSX, PPTX) and HTML files. The following features are not supported:
- There are no angle, width/height and unit with each page object.
- For each object detected, there is no bounding polygon or bounding region.
- Page range (
pages
) is not supported as a parameter.- No
lines
object.
From the documentation: https://learn.microsoft.com/en-us/azure/ai-services/document-intelligence/concept-read?view=doc-intel-4.0.0#pages
I hope this helps.
Regards,
Vasavi
-Please kindly accept the answer and vote 'yes' if you feel helpful to support the community, thanks.