An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
Hi,
this is very unlikely to be caused simply by different numbers of columns per row. Html tables with uneven rows are valid and the layout model can usually handle them, especially when colspan and rowspan are used correctly.
whats stands out in ur snippet is the absolute positioning like left and bottom in inline styles. That pattern is typical of pdf to html exports. In such cases the table structure is often only visual, while the real layout is driven by absolutely positioned div elements. The service then tries to reconstruct a logical reading order and table grid from something that is essentially a positioned canvas, and this can trigger an invalid request instead of a clean parsing error. Common cause is very complex or inconsistent dom structure. Empty cells with nbsp only content, nested divs inside td elements, mixed spans and positioning can produce a grid that cannot be resolved into a consistent internal table model. The layout analyzer expects a coherent structure, not a rendering hack.
The request is truly sending valid html with the correct content type such as text html, and that the file is properly encoded in utf 8. If this html was generated from a pdf, u will almost certainly get more stable results by sending the original pdf to the layout model instead of the converted html.
To narrow it down, try validating the html strictly, then test a simplified version of the failing page without absolute positioning styles. If that works, the issue is structural rather than column count. It is also worth calling the rest api directly and capturing the full raw response body, as studio often hides additional error detail.
In practice ))) thats looks much more like a parsing limitation with pdf style html than a problem with uneven table columns.
rgds,
Alex