Which model should I choose?
Important
- Document Intelligence public preview releases provide early access to features that are in active development.
- Features, approaches, and processes may change, prior to General Availability (GA), based on user feedback.
- The public preview version of Document Intelligence client libraries default to REST API version 2023-10-31-preview.
This content applies to: v4.0 (preview) | Previous versions:
v3.1 (GA)
v3.0 (GA)
This content applies to: v3.1 (GA) | Latest version:
v4.0 (preview) | Previous versions:
v3.0
This content applies to: v3.0 (GA) | Latest versions:
v4.0 (preview)
v3.1 (preview)
Azure AI Document Intelligence supports a wide variety of models that enable you to add intelligent document processing to your applications and optimize your workflows. Selecting the right model is essential to ensure the success of your enterprise. In this article, we explore the available Document Intelligence models and provide guidance for how to choose the best solution for your projects.
The following decision charts highlight the features of each Document Intelligence v3.0 supported model and help you choose the best model to meet the needs and requirements of your application.
Important
Be sure to check the language support page for supported language text and field extraction by feature.
Pretrained document-analysis models
Document type | Example | Data to extract | Your best solution |
---|---|---|---|
A generic document. | A contract or letter. | You want to primarily extract written or printed text lines, words, locations, and detected languages. | Read OCR model |
A document that includes structural information. | A report or study. | In addition to written or printed text, you need to extract structural information like tables, selection marks, paragraphs, titles, headings, and subheadings. | Layout analysis model |
A structured or semi-structured document that includes content formatted as fields (keys) and values. | A form or document that is a standardized format commonly used in your business or industry like a credit application or survey. | You want to extract fields and values including ones not covered by the scenario-specific prebuilt models without having to train a custom model. | **Layout analysis model with the optional query string parameter features=keyValuePairs enabled ** |
Pretrained scenario-specific models
Document type | Data to extract | Your best solution |
---|---|---|
US W-2 tax form | You want to extract key information such as salary, wages, and taxes withheld. | US tax W-2 model |
US Tax 1098 form | You want to extract mortgage interest details such as principal, points, and tax. | US tax 1098 model |
US Tax 1098-E form | You want to extract student loan interest details such as lender and interest amount. | US tax 1098-E model |
US Tax 1098T form | You want to extract qualified tuition details such as scholarship adjustments, student status, and lender information. | US tax 1098-T model |
Contract (legal agreement between parties). | You want to extract contract agreement details such as parties, dates, and intervals. | Contract model |
Health insurance card or health insurance ID. | You want to extract key information such as insurer, member ID, prescription coverage, and group number. | Health insurance card model |
Invoice or billing statement. | You want to extract key information such as customer name, billing address, and amount due. | Invoice model |
Receipt, voucher, or single-page hotel receipt. | You want to extract key information such as merchant name, transaction date, and transaction total. | Receipt model |
Identity document (ID) like a U.S. driver's license or international passport. | You want to extract key information such as first name, last name, date of birth, address, and signature. | Identity document (ID) model |
Mixed-type document(s) with structured, semi-structured, and/or unstructured elements. | You want to extract key-value pairs, selection marks, tables, signature fields, and selected regions not extracted by prebuilt or general document models. | Custom model |
Tip
- If you're still unsure which pretrained model to use, try the layout model with the optional query string parameter
features=keyValuePairs
enabled. - The layout model is powered by the Read OCR engine to detect pages, tables, styles, text, lines, words, locations, and languages.
Custom extraction models
Training set | Example documents | Your best solution |
---|---|---|
Structured, consistent, documents with a static layout. | Structured forms such as questionnaires or applications. | Custom template model |
Structured, semi-structured, and unstructured documents. | ● Structured → surveys ● Semi-structured → invoices ● Unstructured → letters |
Custom neural model |
A collection of several models each trained on similar-type documents. | ● Supply purchase orders ● Equipment purchase orders ● Furniture purchase orders All composed into a single model. |
Composed custom model |
Next steps
Feedback
Submit and view feedback for