Azure document intelligence OCR extraction

Question

Azure document intelligence OCR extraction

Sriramsubramaniyan Nadarajan 76

Hi Team,

We are planning to utilize the document intelligence service of Azure for OCR of specific forms, we are using general documents layout, we are able to get the output but I have few queries.

Our requirement is to extract key value pairs for specific fields and their confidence level

I can see that all are extracted using different components in the sample code, however we are unable to relate them together

For ex Key value pairs are fetched in result.key_value_pairs:

page in result.pages, word in page.words, table_idx, table in enumerate(result.tables):

Can you please provide some suggestions sample code so that our output will be like below

Key - Name

Value - Microsoft

Confidence - 100

if any checkbox are present output should be like

Key - Name

Value - Microsoft

Confidence - 100

Selection mark - selected

We are using python sdk, any suggestions will be highly helpful. Thanks

Accepted answer

0 additional answers

Your answer

Answer 1

VasaviLankipalle-MSFT 18,676 Moderator

Hello @Sriramsubramaniyan Nadarajan , Thanks for using Microsoft Q&A Platform.

Here is the python SDK sample code for the general document model. You can use the below code snippet to generate the key-value pairs along with the confidence scores as per your requirement. You can access the full sample code in the GitHub here: https://github.com/Azure/azure-sdk-for-python/tree/azure-ai-formrecognizer_3.3.0/sdk/formrecognizer/azure-ai-formrecognizer/samples

print("----Key-value pairs found in document----")
for kv_pair in result.key_value_pairs:
   
    if kv_pair.key:
        print(
                "Key '{}' found within '{}' bounding regions".format(
                    kv_pair.key.content,
                    format_bounding_region(kv_pair.key.bounding_regions),
                )
            )
    if kv_pair.value:
        print(
                "Value '{}' found within '{}' bounding regions".format(
                    kv_pair.value.content,
                    format_bounding_region(kv_pair.value.bounding_regions),
                )
            )
    print("confidence score:{} % \n".format(kv_pair.confidence*100))

I hope this helps.

Regards,
Vasavi

-Please kindly accept the answer and vote 'yes' if you feel helpful to support the community, thanks.

Sriramsubramaniyan Nadarajan 76 Reputation points

2023-09-29T15:43:38.59+00:00

Hi @VasaviLankipalle-MSFT ,

Thanks for your help. We are able to get the required key value, confidence result, thanks.

I have one more query, i can see an option to specify the page when analyzing the page in form recognizer studio, can you please let me know if similar option is available while analyzing the document using SDK, REST call.

For ex: If a document has 10 pages, i need to analyze only the first two pages, can you please let me know if its possible to specify the range or page number in SDK / REST call?

or is it possible to define a keyword for the page to analyze, for ex: if my PO with key word purchase order is in first two pages, in a document of 10 page, can form recognizer automatically analyze only the page which has keyword purchase order? Please advise Thanks
VasaviLankipalle-MSFT 18,676 Reputation points Moderator

2023-09-29T16:50:29.2433333+00:00

Hello @Sriramsubramaniyan Nadarajan , Glad to know it helped you.

Yes, you can specify the page range or page number using SDK or REST API.

To specify the page range or page number, you can use the pages parameter in the begin_recognize_content_from_url method of the Form Recognizer client library. Here is the documentation for SDK:

pages Custom page numbers for multi-page documents(PDF/TIFF). Input the page numbers and/or ranges of pages you want to get in the result. For a range of pages, use a hyphen, like pages=["1-3", "5-6"]. Separate each page number or range with a comma. https://learn.microsoft.com/en-us/python/api/azure-ai-formrecognizer/azure.ai.formrecognizer.formrecognizerclient?view=azure-python

You can specify the page parameter in the REST API:

https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-2023-07-31/operations/AnalyzeDocument

I have converted my previous comment to an answer please take time in accepting the answer if you feel helpful.
Sriramsubramaniyan Nadarajan 76 Reputation points

2023-09-29T18:07:37.96+00:00

Hi @VasaviLankipalle-MSFT , Thanks for your suggestions, I had accepted the answer, i will open a new thread in case of any futher queries. Thanks

Share via

Azure document intelligence OCR extraction

0 additional answers

Your answer