Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Important
This feature is in Public Preview and is HIPAA compliant.
Document Parsing uses state-of-the-art research techniques to extract and visualize structured data from a wide range of document types, including but not limited to PDFs, images, Word documents (DOC/DOCX), and PowerPoint files (PPT/PPTX). It's designed to handle complex layouts such as tables, charts, and mixed text-image content.
Document Parsing is built on the ai_parse_document function and includes a UI that allows you to parse documents and immediately inspect their structure through formatted text or structured JSON outputs.
Requirements
- A workspace that includes the following:
- AI ParseDocument Public Preview enabled. See Manage Azure Databricks previews.
- Serverless compute enabled. See Serverless compute requirements.
- Unity Catalog enabled. See Enable a workspace for Unity Catalog.
- Access to a serverless budget policy with a nonzero budget.
- This function is only available in some regions, see AI function availability.
- The
ai_parse_documentfunction is also available to workspaces with the Enhanced Security and Compliance add-on, but workspace administrators must enable it in the Previews portal.
- The
Parse documents
Use Document Parsing to parse your documents and visualize their structure.
- Go to
Agents in the left navigation pane of your workspace.
- Click Create Agent > Document Parsing.
- Select your source document. You can choose to upload a file or select one from an existing Unity Catalog catalog. Supported formats include: PDF, images, DOC/DOCX, and PPT/PPTX.
- Click Parse document.
Parsing your document can take a few minutes. When complete, Document Parsing shows the source document on the left and the parsed document on the right. You can choose to view the parsed document as Formatted text or Raw JSON.

Process and query results
To view the ai_parse_document query and run it on more documents, click Use Agent and choose either to run the query from the SQL Editor or Notebook. You can edit the query to point to the volume or table your documents live in.
Document Parsing provides a UI interface to the SQL function ai_parse_document. See the ai_parse_document reference page for more advanced examples and details.