You can create a one logical copy of KQL database data in an eventhouse by turning on OneLake availability. Turning on OneLake availability means that you can query the data in your KQL database in Delta Lake format via other Fabric engines such as Direct Lake mode in Power BI, Warehouse, Lakehouse, Notebooks, and more.
Delta Lake is a unified data lake table format that achieves seamless data access across all compute engines in Microsoft Fabric. For more information on Delta Lake, see What is Delta Lake?.
In this article, you learn how to turn on availability of KQL Database data in OneLake.
How it works
The following table describes the behavior of your KQL database and tables when you turn on or turn off OneLake availability.
Turned on
Turned off
KQL Database
- Existing tables aren't affected. New tables are made available in OneLake. - The Data retention policy of your KQL database is also applied to the data in OneLake. Data removed from your KQL database at the end of the retention period is also removed from OneLake.
- Existing tables aren't affected. New tables won't be available in OneLake.
A table in KQL Database
- New data is made available in OneLake. - Existing data isn't backfilled. - Data can't be deleted, truncated, or purged. - Table schema can't be altered and the table can't be renamed. - Row Level Security can't be applied to the table.
- New data isn't made available in OneLake. - Data can be deleted, truncated, or purged. - Table schema can be altered and the table can be renamed. - Data is soft deleted from OneLake.
Important
It can take up to a few hours for the data to appear in OneLake. For more information, see Adaptive behavior.
There's no additional storage cost to turn on OneLake availability. For more information, see resource consumption.
You can turn on OneLake availability either on a KQL database or table level.
To turn on OneLake availability, browse to the OneLake section in the details pane of your KQL database or table.
Set Availability to On.
The database refreshes automatically.
You turned on OneLake availability in your KQL database. You can now access all the new data added to your database at the given OneLake path in Delta Lake format. You can also choose to create a OneLake shortcut from a Lakehouse, Data Warehouse, or query the data directly via Power BI Direct Lake mode.
Adaptive behavior
Eventhouse offers a robust mechanism that intelligently batches incoming data streams into one or more Parquet files, structured for analysis. Batching data streams is important when dealing with trickling data. Writing many small Parquet files into the lake can be inefficient resulting in higher costs and poor performance.
Eventhouse's adaptive mechanism can delay write operations for up to a few hours if there isn’t enough data to create optimal Parquet files. This ensures Parquet files are optimal in size and adhere to Delta Lake best practices. The Eventhouse adaptive mechanism ensures that the Parquet files are primed for analysis and balances the need for prompt data availability with cost and performance considerations.
Results are measured from the last time data was added. When Latency results in 00:00:00, all the data in the KQL database is available in OneLake.
View files
When you turn on OneLake availability on a table, a delta log folder is created along with any corresponding JSON and Parquet files. You can view the files that were made available in OneLake and their properties while remaining within Real-Time Intelligence.
To view the files, hover over a table in the Explorer pane and then select the More menu [...] > View files.
To view the properties of the delta log folder or the individual files, hover over the folder or file and then select the More menu [...] > Properties.
To view the files in the delta log folder:
Select the _delta_log folder.
Select a file to view the table metadata and schema. The editor that opens is in read-only format.
If you turn off OneLake availability, the mirroring policy's IsEnabled property is set to false (IsEnabled=false).
Partition delta tables
You can partition your delta tables to improve query speed. For information about when to partition your OneLake files, see When to partition tables. Each partition is represented as a separate column using the PartitionName listed in the Partitions list. This means your OneLake copy has more columns than your source table.
Tables in a Microsoft Fabric lakehouse are based on the Delta Lake technology commonly used in Apache Spark. By using the enhanced capabilities of delta tables, you can create advanced analytics solutions.
Administer an SQL Server database infrastructure for cloud, on-premises and hybrid relational databases using the Microsoft PaaS relational database offerings.