Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Important
- Azure AI Content Understanding is available in preview. Public preview releases provide early access to features that are in active development.
- Features, approaches, and processes can change or have limited capabilities, before General Availability (GA).
- For more information, see Supplemental Terms of Use for Microsoft Azure Previews.
Azure AI Content Understanding offers sophisticated document analysis capabilities, enabling organizations to convert unstructured content into actionable and organized data. Utilizing customizable analyzers, it can expertly extract essential information, fields, and relationships from a diverse range of documents and forms.
Business use cases
Document analyzers can process complex documents in various formats and templates:
- Contract lifecycle management: Extract key fields, clauses, and obligations from various contract types.
- Loan and mortgage applications: Automate processing to enable quicker handling by banks, lenders, and government entities.
- Financial services: Analyze complex documents like financial reports and asset management reports.
- Expense management: Parse receipts and invoices from various retailers to validate expenses across different formats and templates.
- Document sets and knowledge base scenarios: Extract key fields from document sets as a whole, and add reference data that handle tasks like validation and enrichment by applying multi-step reasoning.
Document analyzer capabilities
Content extraction
Content extraction forms the foundation of Azure AI Content Understanding's document analysis capabilities, transforming unstructured documents into structured, machine-readable data. It precisely captures both printed and handwritten text while preserving the document's structure through advanced layout analysis.
- Content Analysis
- Text: Processes multilingual content, including both machine-printed and handwritten text from hundreds of languages.
- Selection marks: Identifies and extracts selection indicators such as checkboxes, radio buttons, and similar markers.
- Barcode detection: Scans and decodes information from over a dozen types of linear and two-dimensional barcodes.
- Mathematical formulas: Captures and preserves complex mathematical expressions in LaTeX format.
- Image elements: Locates and extracts images, diagrams, and charts along with their related captions and annotations.
- Structure Analysis
- Paragraphs: Detects and categorizes text segments based on their document context and role.
- Tabular data: Recognizes and extracts table structures, including complex formats with spanning cells and multi-page layouts.
- Hierarchical sections: Maps content organization through section headers and nested content relationships.
- RAG
- RAG solutions: Content extraction forms the foundation of effective RAG systems by transforming raw multimodal data into structured, searchable formats optimized for retrieval. Learn more about building RAG solutions on our retrieval-augmented generation page.
Field extraction
Field extraction empowers you to extract, classify, and generate structured data from various documents and forms, customized to meet your unique requirements. The process of transforming unstructured content into organized, actionable information simplifies data management, improves searchability, and supports automated workflows. For instance, you can seamlessly extract customer details, billing addresses, and itemized charges from invoices, or identify contractual parties, renewal dates, and payment terms in legal agreements. To maximize efficiency, you can utilize prebuilt analyzer templates—such as ones tailored for invoices—or design bespoke analyzers from scratch, enhancing precision through the labeling of more sample documents.
The confidence and grounding API is an opt-in feature. Set estimateFieldSourceAndConfidence
as true
to opt in for confidence and grounding for field extraction.
Field extraction methods
Azure AI Content Understanding provides versatile methods for field extraction, enabling precise and tailored processing of document content:
Extract: Extract specific data, like transaction dates from receipts or line items from invoices, for precise and focused information capture.
Classify: Categorize document content into predefined categories, such as classifying sentiment in customer call transcript or classifying hotel receipt items.
Generate: Produce new insights or summaries from your documents, including document summaries, and chapter overviews enhancing content accessibility and comprehension.
Key benefits
Content Understanding delivers powerful document analysis capabilities designed to address critical enterprise and business scenarios such as Retrieval-Augmented Generation (RAG) and Robotic Process Automation (RPA). Key benefits include:
Intelligent search enablement: Transform unstructured documents into structured, searchable data assets, significantly improving information discoverability and accessibility across your organization.
Grounded data extraction: Maintain clear traceability and localization of extracted data, facilitating efficient human-in-the-loop review processes and ensuring transparency and compliance.
Confidence-driven automation: Utilize built-in confidence scoring to intelligently automate document processing tasks, optimizing resource allocation, reducing operational costs, and enhancing decision-making accuracy.
Flexible customization: Easily adapt and tailor document analyzers to align with specific business processes and workflows, enabling precise extraction and classification tailored to your organization's unique requirements.
Enhanced accuracy and reliability: Achieve precise extraction and classification of critical business data, significantly reducing errors and improving operational efficiency across automated workflows.
Agents-ready: Process your diverse input and deliver output in a standard format that's ready for an agent's workflow. Outputs can give your application an understanding of user intent, with data supported by a
strongly-typed
schema that makes it easier to quickly get data in a format ready for your code.
Input requirements
For detailed information on supported input document formats, refer to our Service quotas and limits page.
Supported languages and regions
For a detailed list of supported languages and regions, visit our Language and region support page.
Data privacy and security
Developers using Content Understanding should review Microsoft's policies on customer data. For more information, visit our Data, protection, and privacy page.
Next step
- Try processing your document content using Content Understanding in Azure AI Foundry.
- Learn to analyze document content analyzer templates.
- Review code samples: visual document search.