Which model should I choose?

Important

  • Document Intelligence public preview releases provide early access to features that are in active development.
  • Features, approaches, and processes may change, prior to General Availability (GA), based on user feedback.
  • The public preview version of Document Intelligence client libraries default to REST API version 2024-02-29-preview.
  • Public preview version 2024-02-29-preview is currently only available in the following Azure regions:
  • East US
  • West US2
  • West Europe

This content applies to:checkmark v4.0 (preview) | Previous versions: blue-checkmark v3.1 (GA) blue-checkmark v3.0 (GA)

This content applies to: checkmark v3.1 (GA) | Latest version: purple-checkmark v4.0 (preview) | Previous versions: blue-checkmark v3.0

This content applies to: checkmark v3.0 (GA) | Latest versions: purple-checkmark v4.0 (preview) purple-checkmark v3.1

Azure AI Document Intelligence supports a wide variety of models that enable you to add intelligent document processing to your applications and optimize your workflows. Selecting the right model is essential to ensure the success of your enterprise. In this article, we explore the available Document Intelligence models and provide guidance for how to choose the best solution for your projects.

The following decision charts highlight the features of each Document Intelligence v3.0 supported model and help you choose the best model to meet the needs and requirements of your application.

Important

Be sure to check the language support page for supported language text and field extraction by feature.

Pretrained document-analysis models

Document type Example Data to extract Your best solution
A generic document. A contract or letter. You want to primarily extract written or printed text lines, words, locations, and detected languages. Read OCR model
A document that includes structural information. A report or study. In addition to written or printed text, you need to extract structural information like tables, selection marks, paragraphs, titles, headings, and subheadings. Layout analysis model
A structured or semi-structured document that includes content formatted as fields (keys) and values. A form or document that is a standardized format commonly used in your business or industry like a credit application or survey. You want to extract fields and values including ones not covered by the scenario-specific prebuilt models without having to train a custom model. **Layout analysis model with the optional query string parameter features=keyValuePairs enabled **

Pretrained scenario-specific models

Document type Data to extract Your best solution
US W-2 tax form You want to extract key information such as salary, wages, and taxes withheld. US tax W-2 model
US Tax 1098 form You want to extract mortgage interest details such as principal, points, and tax. US tax 1098 model
US Tax 1098-E form You want to extract student loan interest details such as lender and interest amount. US tax 1098-E model
US Tax 1098T form You want to extract qualified tuition details such as scholarship adjustments, student status, and lender information. US tax 1098-T model
US Tax 1099(Variations) form You want to extract information from 1099 forms and its variations (A, B, C, CAP, DIV, G, H, INT, K, LS, LTC, MISC, NEC, OID, PATR, Q, QA, R, S, SA, SB). US tax 1099 model
US Tax 1040(Variations) form You want to extract information from 1040 forms and its variations (Schedule 1, Schedule 2, Schedule 3, Schedule 8812, Schedule A, Schedule B, Schedule C, Schedule D, Schedule E, Schedule EIC, Schedule F, Schedule H, Schedule J, Schedule R, Schedule SE, Schedule Senior). US tax 1040 model
Contract (legal agreement between parties). You want to extract contract agreement details such as parties, dates, and intervals. Contract model
Health insurance card or health insurance ID. You want to extract key information such as insurer, member ID, prescription coverage, and group number. Health insurance card model
Credit/Debit card . You want to extract key information bank cards such as card number and bank name. Credit/Debit card model
Marriage Certificate . You want to extract key information from marriage certificates. Marriage certificate model
Invoice or billing statement. You want to extract key information such as customer name, billing address, and amount due. Invoice model
Receipt, voucher, or single-page hotel receipt. You want to extract key information such as merchant name, transaction date, and transaction total. Receipt model
Identity document (ID) like a U.S. driver's license or international passport. You want to extract key information such as first name, surname, date of birth, address, and signature. Identity document (ID) model
US Mortgage 1003 . You want to extract key information from the Uniform Residential loan application. 1003 form model
US Mortgage 1008 . You want to extract key information from the Uniform Underwriting and Transmittal summary. 1008 form model
US Mortgage Closing Disclosure . You want to extract key information from a mortgage closing disclosure form. Mortgage closing disclosure form model
Mixed-type document(s) with structured, semi-structured, and/or unstructured elements. You want to extract key-value pairs, selection marks, tables, signature fields, and selected regions not extracted by prebuilt or general document models. Custom model

Tip

  • If you're still unsure which pretrained model to use, try the layout model with the optional query string parameter features=keyValuePairs enabled.
  • The layout model is powered by the Read OCR engine to detect pages, tables, styles, text, lines, words, locations, and languages.

Custom extraction models

Training set Example documents Your best solution
Structured, consistent, documents with a static layout. Structured forms such as questionnaires or applications. Custom template model
Structured, semi-structured, and unstructured documents. ● Structured → surveys
● Semi-structured → invoices
● Unstructured → letters
Custom neural model
A collection of several models each trained on similar-type documents. ● Supply purchase orders
● Equipment purchase orders
● Furniture purchase orders
All composed into a single model.
Composed custom model

Custom classification model

Training set Example documents Your best solution
At least two different types of documents. Forms, letters, or documents Custom classification model

Next steps