What is Azure AI Document Intelligence?

Article
08/28/2024

Important

Document Intelligence public preview releases provide early access to features that are in active development. Features, approaches, and processes may change, prior to General Availability (GA), based on user feedback.
The public preview version of Document Intelligence client libraries default to REST API version 2024-07-31-preview.
Public preview version 2024-07-31-preview is currently only available in the following Azure regions. Note that the custom generative (document field extraction) model in AI Studio is only available in North Central US region:
- East US
- West US2
- West Europe
- North Central US

This content applies to: v4.0 (preview) | Previous versions: v3.1 (GA) v3.0 (GA) v2.1 (GA)

This content applies to: v3.1 (GA) | Latest version: v4.0 (preview) | Previous versions: v3.0 v2.1

This content applies to: v3.0 (GA) | Latest versions: v4.0 (preview) v3.1 | Previous version: v2.1

This content applies to: v2.1 | Latest version: v4.0 (preview)

Note

Form Recognizer is now Azure AI Document Intelligence!

As of July 2023, Azure AI services encompass all of what were previously known as Cognitive Services and Azure Applied AI Services.
There are no changes to pricing.
The names Cognitive Services and Azure Applied AI continue to be used in Azure billing, cost analysis, price list, and price APIs.
There are no breaking changes to application programming interfaces (APIs) or SDKs prior to and including v3.1. Starting from v4.0, APIs and SDKs are updated to Document Intelligence.
Some platforms are still awaiting the renaming update. All mention of Form Recognizer or Document Intelligence in our documentation refers to the same Azure service.

Azure AI Document Intelligence is a cloud-based Azure AI service that enables you to build intelligent document processing solutions. Massive amounts of data, spanning a wide variety of data types, are stored in forms and documents. Document Intelligence enables you to effectively manage the velocity at which data is collected and processed and is key to improved operations, informed data-driven decisions, and enlightened innovation.

| ✔️ Document analysis models | ✔️ Prebuilt models | ✔️ Custom models |

General extraction models

General extraction models enable text extraction from forms and documents and return structured business-ready content ready for your organization's action, use, or development.

Read | Extract printed and handwritten text.

Layout | Extract text, tables, and document structure.

Read | Extract printed
and handwritten text.

Layout | Extract text, tables,
and document structure.

General document | Extract text,
structure, and key-value pairs.

Prebuilt models

Prebuilt models enable you to add intelligent document processing to your apps and flows without having to train and build your own models.

Financial Services and Legal

Bank Statement | Extract account information and details from bank statements.

Check | Extract relevant information from checks.

Contract | Extract agreement and party details.

Credit card | Extract payment card information.

Invoice | Extract customer and vendor details.

Pay Stub | Extract pay stub details.

Receipt | Extract sales transaction details.

US Tax

Unified US tax | Extract from any US tax forms supported.

US Tax W-2 | Extract taxable compensation details.

US Tax 1098 | Extract 1098 variation details.

US Tax 1099 | Extract 1099 variation details.

US Tax 1040 | Extract 1040 variation details.

US Mortgage

US mortgage 1003 | Extract loan application details.

US mortgage 1004 | Extract information from appraisal.

US mortgage 1005 | Extract information from validation of employment.

US mortgage 1008 | Extract loan transmittal details.

US mortgage disclosure | Extract final closing loan terms.

Personal Identification

Health Insurance card | Extract insurance coverage details.

Identity | Extract verification details.

Marriage certificate | Extract certified marriage information.

Invoice | Extract customer
and vendor details.

Receipt | Extract sales
transaction details.

Identity | Extract identification
and verification details.

Health Insurance card | Extract health insurance details.

Business card | Extract business contact details.

Contract | Extract agreement
and party details.

US Tax W-2 | Extract taxable
compensation details.

US Tax 1098 | Extract 1098 variation details.

Custom models

Custom models are trained using your labeled datasets to extract distinct data from forms and documents, specific to your use cases. Standalone custom models can be combined to create composed models.

Document field extraction models

✔️ Document field extraction models are trained to extract labeled fields from documents.

Custom generative | Build a custom extraction model using generative AI for documents with unstructured format and varying templates.

Custom neural | Extract data from mixed-type documents.

Custom template | Extract data from static layouts.

Custom composed | Extract data using a collection of models.

Custom classification models

✔️ Custom classifiers identify document types before invoking an extraction model.

Custom classifier | Identify designated document types (classes) before invoking an extraction model.

Add-on capabilities

Document Intelligence supports optional features that can be enabled and disabled depending on the document extraction scenario. The following add-on capabilities are available for 2023-07-31 (GA) and later releases:

The2024-07-31-preview release introduces read model support for searchable PDF output:

`Searchable PDF

Analysis features

Model ID	Content Extraction	Query fields	Paragraphs	Paragraph Roles	Selection Marks	Tables	Key-Value Pairs	Languages	Barcodes	Document Analysis	Formulas*	Style Font*	High Resolution*	Searchable PDF
prebuilt-read	✓						O	O		O	O	O		✓
prebuilt-layout	✓	✓	✓	✓	✓	✓		O	O		O	O	O
prebuilt-document	✓	✓	✓	✓	✓	✓	✓	O	O		O	O	O
prebuilt-businessCard	✓	✓								✓
prebuilt-contract	✓	✓	✓	✓			O	O	✓	O	O	O
prebuilt-healthInsuranceCard.us	✓	✓						O	O	✓	O	O	O
prebuilt-idDocument	✓	✓						O	O	✓	O	O	O
prebuilt-invoice	✓	✓			✓	✓	O	O	O	✓	O	O	O
prebuilt-receipt	✓	✓						O	O	✓	O	O	O
prebuilt-marriageCertificate.us	✓	✓						O	O	✓	O	O	O
prebuilt-creditCard	✓	✓						O	O	✓	O	O	O
prebuilt-check.us	✓	✓						O	O	✓	O	O	O
prebuilt-payStub.us	✓	✓						O	O	✓	O	O	O
prebuilt-bankStatement	✓	✓						O	O	✓	O	O	O
prebuilt-mortgage.us.1003	✓	✓						O	O	✓	O	O	O
prebuilt-mortgage.us.1004	✓	✓						O	O	✓	O	O	O
prebuilt-mortgage.us.1005	✓	✓						O	O	✓	O	O	O
prebuilt-mortgage.us.1008	✓	✓						O	O	✓	O	O	O
prebuilt-mortgage.us.closingDisclosure	✓	✓						O	O	✓	O	O	O
prebuilt-tax.us	✓	✓			✓			O	O	✓	O	O	O
prebuilt-tax.us.w2	✓	✓			✓			O	O	✓	O	O	O
prebuilt-tax.us.1098	✓	✓			✓			O	O	✓	O	O	O
prebuilt-tax.us.1098E	✓	✓			✓			O	O	✓	O	O	O
prebuilt-tax.us.1098T	✓	✓			✓			O	O	✓	O	O	O
prebuilt-tax.us.1099(variations)	✓	✓			✓			O	O	✓	O	O	O
prebuilt-tax.us.1040(variations)	✓	✓						O	O	✓	O	O	O
{ customModelName }	✓	✓	✓	✓	✓	✓		O	O	✓	O	O	O

✓ - Enabled
O - Optional
* - Premium features incur extra costs

Models and development options

Note

The following document understanding models and development options are supported by the Document Intelligence service v3.0.

You can use Document Intelligence to automate document processing in applications and workflows, enhance data-driven strategies, and enrich document search capabilities. Use the links in the table to learn more about each model and browse development options.

Read

Screenshot of Read model analysis using Document Intelligence Studio.

Model ID	Description	Automation use cases	Development options
prebuilt-read	● Extract text from documents. ● Data extraction	● Digitizing any document. ● Compliance and auditing. ● Processing handwritten notes before translation.	● Document Intelligence Studio ● REST API ● C# SDK ● Python SDK ● Java SDK ● JavaScript

Model type	Model name
Document analysis model	● Layout analysis model
Prebuilt models	● Invoice model ● Receipt model ● Identity document (ID) model ● Business card model
Custom models	● Custom model ● Composed model

Model	Description	Development options
Layout analysis	Extraction and analysis of text, selection marks, tables, and bounding box coordinates, from forms and documents.	● Document Intelligence labeling tool ● REST API ● Client-library SDK ● Document Intelligence Docker container
Custom model	Extraction and analysis of data from forms and documents specific to distinct business data and use cases.	● Document Intelligence labeling tool ● REST API ● Sample Labeling Tool ● Document Intelligence Docker container
Invoice model	Automated data processing and extraction of key information from sales invoices.	● Document Intelligence labeling tool ● REST API ● Client-library SDK ● Document Intelligence Docker container
Receipt model	Automated data processing and extraction of key information from sales receipts.	● Document Intelligence labeling tool ● REST API ● Client-library SDK ● Document Intelligence Docker container
Identity document (ID) model	Automated data processing and extraction of key information from US driver's licenses and international passports.	● Document Intelligence labeling tool ● REST API ● Client-library SDK ● Document Intelligence Docker container
Business card model	Automated data processing and extraction of key information from business cards.	● Document Intelligence labeling tool ● REST API ● Client-library SDK ● Document Intelligence Docker container

Share via

What is Azure AI Document Intelligence?

General extraction models

Prebuilt models

Financial Services and Legal

US Tax

US Mortgage

Personal Identification

Custom models

Document field extraction models

Custom classification models

Add-on capabilities

Analysis features

Models and development options

Read

Layout

General document (deprecated in 2023-10-31-preview)

Invoice

Receipt

Identity (ID)

Check

Pay stub

Bank statement

Health insurance card

Contract model

Credit card model

Marriage certificate model

US mortgage 1003 form

US mortgage 1004 form

US mortgage 1005 form

US mortgage 1008 form

US mortgage disclosure form

US Tax W-2 model

US tax 1098 (and variations) forms

US tax 1099 (and variations) forms

US tax 1040 (and variations) forms

Unified US tax forms

Business card

Custom model overview

Custom generative (document field extraction)

Custom neural

Custom template

Custom composed

Custom classification model

Document Intelligence models and development options

Data privacy and security

Next steps

Feedback

Additional resources