Running document intelligence to extract Layout but the output given is from a diffeent model (general-document)

Question

Running document intelligence to extract Layout but the output given is from a diffeent model (general-document)

Jacopo Morabito 15

Hi, while using the Python SDK to request the analysis of a PDF document from the Document Intellicence service, we are getting different result between docs tested on document intelligence studio end the results coming back from the SDK call.

More details:
when we test the Layout (modelid="prebilt-layout") model to extract tables in the Azure Document Intelligence Studio we get the tables extracted correctly. From the SDK, the tables are extracted with some missing columns, the weird thing is that the result is the same as if the model used is the General Documents (modelid="prebuilt-document"). Is it possible that the Prebuilt-layout model is not yet supported or that there is some parameter I am missing?

Runtime Python3.11
API version used: tested all of them

Ramr-msft 17,826 Reputation points

2024-01-28T05:07:55.73+00:00
Thanks for the question, Can you please share the pdf document to check. With the Document Intelligence 2023-10-31-preview, the general document model (prebuilt-document) is deprecated. Going forward, to extract key-value pairs from documents, use the prebuilt-layout model with the optional query string parameter features=keyValuePairs enabled.

API Version: Ensure that you’re using the same API version in both the SDK and the Studio. Different versions may have different features or behaviors.

Model ID: Double-check that you’re using the correct model ID (prebuilt-layout) in your SDK calls.

Parameters: Make sure you’re passing the correct parameters in your SDK calls. For example, if you want to extract key-value pairs, you should enable the features=keyValuePairs option.
Aditya R Bhat 5 Reputation points

2024-05-29T09:12:57.0233333+00:00

Same issue happening with me as well. I am trying to extract data from a PDF. If i run latest DI version deployed in East US in DI studio, it is able to extract everything. But if i extract using Python SDK, it is missing a specific paragraph. I am using the prebuilt-layout model.

We are planning to scale the project to extract content from 2000 PDFs. Please tell me how to fix this.

I am using Doc Int API version 2023-10-31-preview

1 answer

Your answer

Ramr-msft 17,826 Reputation points

2024-01-28T05:07:55.73+00:00

Thanks for the question, Can you please share the pdf document to check. With the Document Intelligence 2023-10-31-preview, the general document model (prebuilt-document) is deprecated. Going forward, to extract key-value pairs from documents, use the prebuilt-layout model with the optional query string parameter features=keyValuePairs enabled.

API Version: Ensure that you’re using the same API version in both the SDK and the Studio. Different versions may have different features or behaviors.

Model ID: Double-check that you’re using the correct model ID (prebuilt-layout) in your SDK calls.

Parameters: Make sure you’re passing the correct parameters in your SDK calls. For example, if you want to extract key-value pairs, you should enable the features=keyValuePairs option.
Aditya R Bhat 5 Reputation points

2024-05-29T09:12:57.0233333+00:00

Same issue happening with me as well. I am trying to extract data from a PDF. If i run latest DI version deployed in East US in DI studio, it is able to extract everything. But if i extract using Python SDK, it is missing a specific paragraph. I am using the prebuilt-layout model.

We are planning to scale the project to extract content from 2000 PDFs. Please tell me how to fix this.

I am using Doc Int API version 2023-10-31-preview

Answer 1

Jakov Prpić 0

Encountering a little bit different, but basically the same bug. There is a discrepancy between document intelligence studio output and SDK output on .NET as well. Same API version, same model, same extra features enabled, same image, but confidence score from every single read word is different. I am having values read wrong from SDK and their confidence score in unacceptable levels whereas in document intelligence studio I am having amazing results.

We need an urgent fix for this.

Share via

Running document intelligence to extract Layout but the output given is from a diffeent model (general-document)

1 answer

Your answer