What is a Genie space

This page introduces Genie, an Azure Databricks feature that allows business teams to interact with their data using natural language. It uses generative AI tailored to your organization's terminology and data, with the ability to monitor and refine its performance through user feedback.

Overview

Domain experts, such as data analysts, configure Genie spaces with datasets, sample queries, and text guidelines to help Genie translate business questions into analytical queries. After setup, business users can ask questions and generate visualizations to understand operational data. You can continuously update Genie's semantic knowledge as your data changes and users pose new questions. For additional information about Databricks AI-powered features, see Databricks AI assistive features.

Genie selects relevant names and descriptions from annotated tables and columns to convert natural language questions to an equivalent SQL query. Then, it responds with the generated query and results table, if possible. If Genie can't generate an answer, it can ask follow-up questions to clarify before providing a response.

Example use cases

You can create different Genie spaces to serve various non-technical audiences. The following scenarios describe two possible use cases.

Example 1: Visualize top selling product

A sales manager wants to understand the top selling product over time in their bakery. They can interact with the Genie space using natural language and automatically generate a visualization.

The following GIF shows this interaction:

Gif with sample question, response, and auto-generated visualization

Example 2: Tracking logistics

A logistics company wants to use Genie spaces to help business users from different departments track operational and financial details. They set up a Genie space for their shipment facility managers to track shipments and another for their financial executives to understand their financial health.

What data should I use?

A Genie space is based on data registered to Unity Catalog, including managed tables, external tables, foreign tables, views, metric views, and materialized views. Genie uses the metadata attached to Unity Catalog objects, as well as an author-curated space-level knowledge store, to generate responses. Well-annotated datasets, paired with specific instructions that you provide, are key to creating a positive experience for end users.

Note

Genie works with structured data only. It cannot answer questions about unstructured data such as PDFs, Word documents, or other file-based content. To give Genie access to unstructured documents, use Chat in Databricks One, which can connect to external document sources such as Google Drive or SharePoint.

File uploads

Important

This feature is in Public Preview.

File uploads allow users to blend their local CSV and Excel files with Unity Catalog data to answer questions. To enable file uploads, contact your Databricks account team. For more information, see Upload a file.

How Genie works

Genie uses a compound AI system to interpret business questions and generate answers. Instead of using a single large language model, compound AI systems process tasks in AI applications by combining multiple interacting components. Compound AI systems are an increasingly common design pattern for AI applications because of their performance and flexibility. For more information, see The Shift from Models to Compound AI Systems.

Language support

You can use Genie in languages other than English, such as Portuguese and French. However, the underlying agent framework wraps prompts in English.

Databricks recommends that space creators add as much metadata as possible in their language of choice. Genie responses might sometimes appear in English due to the underlying system prompts.

What is Genie's knowledge store?

Genie's knowledge store allows authors to:

Edit metadata locally: Genie authors can add space-specific metadata to data assets. For example, it can include company-specific information relevant to how the space is used. This includes table and column metadata descriptions, column-level synonyms, and prompt matching capabilities, which Genie consults when generating answers. A detailed metadata layer helps Genie retrieve the correct information and produce more accurate results.
Provide structured, fine-grained instructions: Authors can define JOIN relationships between tables, to teach Genie how to author SQL across multiple tables.

See Build a knowledge store for more reliable Genie spaces.

How does Genie generate a response?

When a user submits a question, Genie parses the request, identifies relevant data sources, and determines how to generate an appropriate response. Details provided by authors, combined with relevant Unity Catalog comments, metadata, and sample values from selected columns, allow Genie to infer both business and technical logic. For more information, see Databricks AI assistive features trust and safety and prompt matching. Genie intelligently filters example SQL queries, table and column metadata, and chat history to select the most relevant context for answering the request.

Genie generates responses using components such as the following:

Unity Catalog table metadata: Includes table names, descriptions, and defined primary key (PK) and foreign key (FK) relationships. Genie uses this data as it parses the request and converts the natural language prompt to SQL.
Column names and descriptions: Genie intelligently filters for relevant column names and descriptions to include.
Knowledge store context: Authors can locally edit asset metadata and choose columns that provide relevant values to Genie. This helps Genie generate more accurate responses and doesn't alter existing Unity Catalog metadata. See Build a knowledge store for more reliable Genie spaces.
Example SQL queries: Genie intelligently selects relevant SQL examples from SQL Queries.
SQL functions: All SQL functions that have been added in the space.
Instructions: The plain-text notes provided as General instructions are included as context.
Prompt and responses history: Prompts and responses from the current chat are included as context. If necessary, because of set token limits, the oldest parts of the chat record are excluded.

Note

Some table details, such as the owner and table size, are not included by default. To access this information, use views from the information schema available for all Unity Catalog catalogs. Default views might include unnecessary details, so creating a custom view on top of that can help focus on the specific information you need. For more information about what's available in the information schema, see Information schema.

In many cases, Genie generates a SQL query that runs on the space's SQL warehouse. Generated queries are always read-only. Retries are handled automatically, and the SQL warehouse handles concurrency and scale. The result set is presented as part of the response.

Genie maintains strong security and privacy controls. For details, see Databricks AI assistive features trust and safety.

Improve response accuracy using Inspect

Important

This feature is in Public Preview.

Inspect uses advanced reasoning to review and improve the accuracy of Genie's generated SQL queries. When you enable Inspect for a response, Genie:

Reviews the initially generated SQL query.
Authors smaller SQL statements to verify specific aspects of the query, such as:
- Confirming the correct filter values are included.
- Validating date range logic, such as trailing 7-day windows.
- Checking join conditions and aggregations.
Identifies gaps or potential issues in the original query.
If issues are identified, generates an improved SQL query that resolves them.
Performs a final comparison between the original and improved queries.
Returns the query that most accurately answers your question.

Use Inspect when you want additional confidence in query accuracy, especially for complex queries involving filters, date ranges, or multiple tables.

Set up a Genie space

You can create a Genie space if you have the following:

The Databricks SQL entitlement.
At least CAN USE permission on a pro or serverless SQL warehouse.
At least SELECT privileges on one or more Unity Catalog data objects.

See Set up and manage a Genie space.

Companion Genie spaces for AI/BI dashboards (Public Preview)

You can use natural language prompts to generate visualizations for AI/BI dashboards with Genie Code. See Use Genie Code for dashboard authoring.

When you create a dashboard, Databricks automatically creates a companion Genie space that allows business users to conduct self-serve data analytics using natural language. See Genie spaces with dashboards.

Interact with a Genie space

Business teams are the end users for a Genie space. To use a Genie space, business users must have:

The consumer access or Databricks SQL entitlement.
At least SELECT privileges on all of the Unity Catalog data objects used in the space. Users only see data they have permission to access.

Queries run using the compute credentials embedded by the author who configured the warehouse. End users do not need direct warehouse permissions.

Business users can help curate a space by testing it and providing feedback during development. To learn more about how business users can start working with a Genie space, see Use a Genie space to explore business data.

Trusted assets

Trusted assets convey an extra layer of assurance in the accuracy of a result to a space user. When the exact text of a parameterized example query or SQL function is used to generate a response, Genie marks the response as Trusted. See Trusted assets to learn more about trusted assets and working with parameterized queries.

Evaluate responses with benchmarks

Benchmarks allow you to scale up testing and evaluation of individual responses in a Genie space. Unlike instructions, benchmarks are meant to evaluate, not inform, your Genie space. Genie does not use benchmark questions or example SQL to improve Genie's context.

Using benchmarks, you can run a collection of test questions and use the responses to measure Genie's accuracy. Optionally, you can include a SQL statement that returns the expected results. When the benchmark question runs, Genie's response is compared to the results provided by the SQL statement and scored for accuracy. The question is marked for review if no SQL answer has been provided.

See Use benchmarks in a Genie space.

How data access works

Data access in a Genie space is governed by Unity Catalog. When a user asks a question, the generated SQL query runs against the data using the compute credentials embedded by the space author (the configured SQL warehouse). Each user's own Unity Catalog data permissions are applied to the query results. Users only see data they are authorized to access. Any question about data they cannot access returns an empty response.

This means:

You do not need to grant users direct warehouse permissions.
Row filters and column masks defined in Unity Catalog are automatically enforced per user.
To implement per-user data filtering, apply row-level security to the underlying tables in Unity Catalog. See Row filters and column masks.

For information about setting up user permissions for a Genie space, see Share a Genie space.

Privacy and security

Data access in a Genie space is governed by Unity Catalog, including any row filters and column masks that have been applied to your tables. See Data access control and Row filters and column masks.

For other privacy and security FAQs, see the privacy and security FAQs for AI assistive features.

Additional resources

To use the Genie API to integrate Genie into applications, chatbots, and agent frameworks, see Use the Genie API to integrate Genie into your applications.
To use audit logs to track Genie activity and usage, see Audit logs for AI/BI.
For best practices and troubleshooting, see Curate an effective Genie space.

Feedback

Was this page helpful?

Last updated on 2026-04-22