Which model should I choose?

Important

  • Document Intelligence public preview releases provide early access to features that are in active development.
  • Features, approaches, and processes may change, prior to General Availability (GA), based on user feedback.
  • The public preview version of Document Intelligence client libraries default to REST API version 2023-10-31-preview.

This content applies to:checkmark v4.0 (preview) | Previous versions: blue-checkmark v3.1 (GA) blue-checkmark v3.0 (GA)

This content applies to: checkmark v3.1 (GA) | Latest version: purple-checkmark v4.0 (preview) | Previous versions: blue-checkmark v3.0

This content applies to: checkmark v3.0 (GA) | Latest versions: purple-checkmark v4.0 (preview) purple-checkmark v3.1 (preview)

Azure AI Document Intelligence supports a wide variety of models that enable you to add intelligent document processing to your applications and optimize your workflows. Selecting the right model is essential to ensure the success of your enterprise. In this article, we explore the available Document Intelligence models and provide guidance for how to choose the best solution for your projects.

The following decision charts highlight the features of each Document Intelligence v3.0 supported model and help you choose the best model to meet the needs and requirements of your application.

Important

Be sure to check the language support page for supported language text and field extraction by feature.

Pretrained document-analysis models

Document type Example Data to extract Your best solution
A generic document. A contract or letter. You want to primarily extract written or printed text lines, words, locations, and detected languages. Read OCR model
A document that includes structural information. A report or study. In addition to written or printed text, you need to extract structural information like tables, selection marks, paragraphs, titles, headings, and subheadings. Layout analysis model
A structured or semi-structured document that includes content formatted as fields (keys) and values. A form or document that is a standardized format commonly used in your business or industry like a credit application or survey. You want to extract fields and values including ones not covered by the scenario-specific prebuilt models without having to train a custom model. **Layout analysis model with the optional query string parameter features=keyValuePairs enabled **

Pretrained scenario-specific models

Document type Data to extract Your best solution
US W-2 tax form You want to extract key information such as salary, wages, and taxes withheld. US tax W-2 model
US Tax 1098 form You want to extract mortgage interest details such as principal, points, and tax. US tax 1098 model
US Tax 1098-E form You want to extract student loan interest details such as lender and interest amount. US tax 1098-E model
US Tax 1098T form You want to extract qualified tuition details such as scholarship adjustments, student status, and lender information. US tax 1098-T model
Contract (legal agreement between parties). You want to extract contract agreement details such as parties, dates, and intervals. Contract model
Health insurance card or health insurance ID. You want to extract key information such as insurer, member ID, prescription coverage, and group number. Health insurance card model
Invoice or billing statement. You want to extract key information such as customer name, billing address, and amount due. Invoice model
Receipt, voucher, or single-page hotel receipt. You want to extract key information such as merchant name, transaction date, and transaction total. Receipt model
Identity document (ID) like a U.S. driver's license or international passport. You want to extract key information such as first name, last name, date of birth, address, and signature. Identity document (ID) model
Mixed-type document(s) with structured, semi-structured, and/or unstructured elements. You want to extract key-value pairs, selection marks, tables, signature fields, and selected regions not extracted by prebuilt or general document models. Custom model

Tip

  • If you're still unsure which pretrained model to use, try the layout model with the optional query string parameter features=keyValuePairs enabled.
  • The layout model is powered by the Read OCR engine to detect pages, tables, styles, text, lines, words, locations, and languages.

Custom extraction models

Training set Example documents Your best solution
Structured, consistent, documents with a static layout. Structured forms such as questionnaires or applications. Custom template model
Structured, semi-structured, and unstructured documents. ● Structured → surveys
● Semi-structured → invoices
● Unstructured → letters
Custom neural model
A collection of several models each trained on similar-type documents. ● Supply purchase orders
● Equipment purchase orders
● Furniture purchase orders
All composed into a single model.
Composed custom model

Next steps