Share via

Document Intelligence model prebuilt-check.us - Incorrect behaviour on WordAmount field/attribute

Sinha, Rahul 1 0 Reputation points
2026-04-06T15:47:39.0633333+00:00

Hi,

We are using Azure Document Intelligence model (prebuilt-check.us) for extracting US check OCR

In below use case ..

Where content it's reading correctly for check as "Three Dollar's" however valueNumber it's giving as 510. Which is same as NumberAmount attribute..

Could you please advise how this valueNumber is deriving as 510 in this case? Is it just putting NumberAmount value as valueNumber ?

    "WordAmount": {

      "type": "number",

      **"valueNumber": 510.0,**

      **"content": "THREE DOLLAR'S ONLY\nDollars",**

      "boundingRegions": [

        {

          "pageNumber": 1,

          "polygon": [

            69.0,

            261.0,

            988.0,

            268.0,

            988.0,

            311.0,

            69.0,

            303.0

          ]

        }

      ],

      "spans": [

        {

          "offset": 159,

          "length": 27

        }

      ],

      "confidence": 0.378

    "**NumberAmount**": {

      "type": "number",

      "valueNumber": **510.0**,

      "content": "$**510**.00",
Azure Document Intelligence in Foundry Tools

2 answers

Sort by: Most helpful
  1. Samtayar Leehar 0 Reputation points
    2026-04-19T19:41:55.4466667+00:00

    Interesting issue. It looks like the model is trying to infer the numeric value from surrounding fields instead of relying only on the written text, which can definitely cause problems in financial workflows. Using confidence scores and separate validation logic sounds like the safest workaround for now. For any system where calculations must be precise, I usually recommend an independent verification step with a simple GWA calculator or similar tool logic to cross-check outputs before final processing.

    0 comments No comments

  2. Ghulam Muhayyu Din 0 Reputation points
    2026-04-19T17:30:10.47+00:00

    Hello Rahul,

    It is actually expected behavior for the Document Intelligence prebuilt-check.us model to normalize numeric extractions.

    While the content attribute correctly captures the verbatim text string (e.g., "THREE DOLLAR'S ONLY"), the valueNumber field is designed to parse and return the numerical equivalent (e.g., 510.0) based on the overall context, often cross-referencing the NumberAmount field if the model's heuristics determine they represent the same financial entity. For precise text extraction without numerical conversion, you should adjust your application logic to exclusively pull from the content string or the spans array, rather than relying on the valueNumber attribute.

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.