Running document intelligence to extract Layout but the output given is from a diffeent model (general-document)

Jacopo Morabito 10 Reputation points
2024-01-26T09:16:23.9733333+00:00

Hi, while using the Python SDK to request the analysis of a PDF document from the Document Intellicence service, we are getting different result between docs tested on document intelligence studio end the results coming back from the SDK call.

More details:
when we test the Layout (modelid="prebilt-layout") model to extract tables in the Azure Document Intelligence Studio we get the tables extracted correctly. From the SDK, the tables are extracted with some missing columns, the weird thing is that the result is the same as if the model used is the General Documents (modelid="prebuilt-document"). Is it possible that the Prebuilt-layout model is not yet supported or that there is some parameter I am missing?

Runtime Python3.11
API version used: tested all of them

Azure AI Document Intelligence
Azure AI Document Intelligence
An Azure service that turns documents into usable data. Previously known as Azure Form Recognizer.
1,541 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Jakov Prpić 0 Reputation points
    2024-07-23T07:40:08.03+00:00

    Encountering a little bit different, but basically the same bug. There is a discrepancy between document intelligence studio output and SDK output on .NET as well. Same API version, same model, same extra features enabled, same image, but confidence score from every single read word is different. I am having values read wrong from SDK and their confidence score in unacceptable levels whereas in document intelligence studio I am having amazing results.

    We need an urgent fix for this.

    0 comments No comments