What's new and planned for Synapse Data Warehouse in Microsoft Fabric

Important

The release plans describe functionality that may or may not have been released yet. The delivery timelines and projected functionality may change or may not ship. Refer to Microsoft policy for more information.

Synapse Data Warehouse in Microsoft Fabric is the first data warehouse that supports multi-table transactions and natively embraces an open data format. The warehouse is built on the robust SQL Server query optimizer and an enterprise grade distributed query processing engine that eliminates the need for configuration and management. Synapse Data Warehouse in Microsoft Fabric seamlessly integrates with Data Factory for data ingestion, Power BI for analysis and reporting, and Synapse Spark for data science and machine learning. It streamlines an organization's analytics investments by converging data lakes and warehouses.

Data warehousing workloads benefit from the rich capabilities of the SQL engine over an open data format, enabling customers to focus on analysis and reporting. They also benefit from accessing data from OneLake, a data lake storage virtualization service.

To learn more, see the documentation and visit the announcement blog.

Investment areas

Feature Estimated release timeline
Case insensitive collation support Q2 2024
ALTER TABLE - add nullable column Q3 2024
TRUNCATE Q3 2024
Result set caching Q3 2024
Automatic statistics enhancements Q3 2024
Workload execution insights Q3 2024
Query insights updates Q3 2024
Nested CTE Q3 2024
Time travel Q3 2024
Warehouse monitoring experience Q3 2024
In-place restore within warehouse editor Q2 2024
COPY INTO support for secure storage Q2 2024
Notebook integration Q3 2024
Mirroring GA Q4 2024
Copilot - public preview Q2 2024
VARCHAR(MAX)/VARBINARY(MAX) types - public preview Q3 2024
String performance improvements - public preview Q3 2024

Case insensitive collation support

Estimated release timeline: Q2 2024

Using the public REST APIs to create a Data Warehouse includes a new option to set the default collation.  This can be used to set a new Case Insensitive Collation default.  You'll also be able to use the COLLATE command with CREATE TABLE to directly control which collation your VARCHAR fields use.  The two supported collations are Latin1_General_100_CI_AS_KS_WS_SC_UTF8 (which is Case Insensitive) and Latin1_General_100_BIN2_UTF8 (which is Case Sensitive) and continues to be our default.

ALTER TABLE - Add nullable column

Estimated release timeline: Q3 2024

Support for ALTER TABLE ADD COLUMN to be able to extend already existing tables with new columns that allow NULL values.

TRUNCATE

Estimated release timeline: Q3 2024

The TRUNCATE command quickly removes all rows of data from a table.

Result set caching

Estimated release timeline: Q3 2024

Result set caching saves the results of applicable queries and returns them immediately on subsequent runs, which drastically reduce execution time by bypassing recompilation and recomputation. Cache is automatically managed and requires no manual intervention.

Automatic statistics enhancements

Estimated release timeline: Q3 2024

Various enhancements are planned – shorter execution time of automatic statistic updates, opportunistic support for VARCHAR(MAX) column types, improved storage of intermediate statistic steps, automatic statistics maintenance outside of user query.

Workload execution insights

Estimated release timeline: Q3 2024

This new feature gives users more insight into how a query could perform before they run a query. Users get information on the key steps the query executes, how similar queries have done in the past and the overall condition of the warehouse that could impact your workload.

Query insights updates

Estimated release timeline: Q3 2024

A historic view of your closed sessions will be made available via Query Insights. This addition it helps you analyze traffic, load, and usage of your DW.

Nested CTE

Estimated release timeline: Q3 2024

Common Table Expressions (CTE) increases the readability and simplification for complex queries by deconstructing ordinarily complex queries into simple blocks to be used and reused if necessary, instead of rewriting the query.   A nested CTE is defined with the definition of another CTE.

Time travel

Estimated release timeline: Q3 2024

The ability to time travel at the T-SQL statement level empowers users to query historical data from various past timeframes by specifying the timestamp only once for the entire query. Time travel helps save significantly on storage costs by using single copy of data present in One Lake for conducting historical trend analysis, troubleshooting, and data reconciliation. Additionally, it also facilitates achieving stable reporting by upholding the data integrity across various tables within the data warehouse.

Warehouse monitoring experience

Estimated release timeline: Q3 2024

Using the built-in warehouse monitoring experience, you can view both live queries and historical queries, monitor, and troubleshoot performance of their end-to-end solution.

In-place restore within warehouse editor

Estimated release timeline: Q2 2024

You can now easily create restore points and restore the warehouse to a known good state in the event of accidental corruption, using the Warehouse editor experience.

COPY INTO support for secure storage

Estimated release timeline: Q2 2024

You can now ingest data into your Warehouse using COPY INTO from an external Azure storage account that is protected behind a Firewall. 

Notebook integration

Estimated release timeline: Q3 2024

You can start using T-SQL language support within Notebooks which combines the power of Notebooks and SQL within the same experience - enabling intellisense, autocomplete, cross database queries, richer visualizations and the ability to easily collaborate and share using Notebooks.

Mirroring (GA)

Estimated release timeline: Q4 2024

Mirroring enables users to seamlessly connect their data in databases and data warehouses to Microsoft Fabric without any ETL. Data is replicated as Delta tables in OneLake in near real-time and all Fabric experiences of data warehousing, data science, Power BI Direct Lake mode, copilot, etc. will work out-of-the-box. This release supports Snowflake, Azure Cosmos DB and Azure SQL DB.

Copilot - public preview

Estimated release timeline: Q2 2024

Copilot enables developers of any skill level to quickly build and query a warehouse in Fabric. Copilot offers advice and best practices, autocomplete code, help fix and document code, and offer assistance with data prep, modeling, and analysis.

VARCHAR(MAX)/VARBINARY(MAX) types - public preview

Estimated release timeline: Q3 2024

Users can define columns with VARCHAR(MAX)/VARBINARY(MAX) types in Data warehouse to store string or binary data up to 1 MB. In SQL endpoint for the Lakehouse, the string types in Delta tables are represented as VARCHAR(MAX) without truncation to 8 KB. The performance differences between the queries that are working with VARCHAR(MAX) and VARCHAR(8000) types are minimized, which enables users to use large types without significant performance penalty.

String performance improvements - public preview

Estimated release timeline: Q3 2024

Operations on strings (VARCHAR(N)) are common in T-SQL queries. Performance improvements on string functions and operators that are working with strings boosts the performance of the queries that use LIKE predicates, string functions and comparison operators in WHERE predicates, and operators like GROUP BY, ORDER BY, JOIN that are working with string types.