Serverless compute release notes

This article explains the features and behaviors that are currently available and upcoming on serverless compute for notebooks and jobs.

For more information on serverless compute, see Connect to serverless compute.

Azure Databricks periodically releases updates to serverless compute, automatically upgrading the serverless compute runtime to support enhancements and upgrades to the platform. All users get the same updates, rolled out over a short period of time.

Serverless environment versions

Serverless compute for notebooks and jobs uses environment versions, which provide a stable client API based on Spark Connect to ensure application compatibility. This allows Databricks to upgrade the server independently, delivering performance improvements, security enhancements, and bug fixes without requiring any code changes to workloads.

Each environment version includes a specific Python version and a set of Python packages with defined versions. Databricks introduces new features and fixes in the latest environment version while applying security updates to all supported environment versions.

For serverless environment version release notes, see Serverless environment versions.

Release notes

This section includes release notes for serverless compute. Release notes are organized by year and week of year. Serverless compute always runs using the most recently released version listed here.

Serverless version 18.0
Version 18.0
Serverless environment version 5 is now available
Version 17.3
Version 17.2
Version 17.1
Serverless environment version 4
Version 17.0
Serverless performance targets is GA
Version 16.4
Performance mode is now configurable on serverless jobs
Version 16.3
Version 16.2
High memory setting available on serverless notebooks (Public Preview)
Version 16.1
Version 15.4
The JDK is upgraded from JDK 8 to JDK 17
Version 15.1
Version 14.3

Serverless version 18.0

February 27, 2026

Version 18.0

February 27, 2026

This serverless compute release roughly corresponds to Databricks Runtime 18.0.

New features

SQL scripting is now generally available: SQL scripting is now generally available.
Redshift JDBC driver upgraded to 2.1.0.28: The Redshift JDBC driver has been upgraded to version 2.1.0.28.
Shared isolation execution environment for Unity Catalog Python UDFs: Unity Catalog Python UDFs with the same owner can now share an isolation environment by default, improving performance and reducing memory usage. To ensure a UDF always runs in a fully isolated environment, add the STRICT ISOLATION characteristic clause. See Environment isolation.
SQL window functions in metric views: You can now use SQL window functions in metric views to calculate running totals, rankings, and other window-based calculations.
Dynamic shuffle partition adjustment in stateless streaming queries: You can now change the number of shuffle partitions in stateless streaming queries without restarting the query.
Adaptive Query Execution and auto-optimized shuffle in stateless streaming queries: Adaptive Query Execution (AQE) and auto-optimized shuffle (AOS) are now supported in stateless streaming queries.
Literal string coalescing everywhere: The ability to coalesce sequential string literals such as 'Hello' ' World' into 'Hello World' has been expanded to any place string literals are allowed. See STRING type.
Parameter markers everywhere: You can now use named (:param) and unnamed (?) parameter markers virtually anywhere a literal value of the appropriate type can be used, including DDL statements and column types. See Parameter markers.
IDENTIFIER clause everywhere: The IDENTIFIER clause, which casts strings to SQL object names, has been expanded to nearly everywhere an identifier is permitted, including column aliases and column definitions. See IDENTIFIER clause.
New BITMAP_AND_AGG function: A new bitmap_and_agg aggregate function function is now available.
New Theta sketch functions: A new library of functions for approximate distinct count and set operations using Datasketches Theta Sketch is now available, including theta_sketch_agg, theta_union_agg, theta_intersection_agg, and related functions.
New KLL Sketch function library: A new library of functions for building KLL Sketches for approximate quantile computation is now available, including kll_sketch_agg_bigint, kll_sketch_agg_double, and related functions.
Apache Parquet library upgraded to 1.16.0: The Apache Parquet library has been upgraded to version 1.16.0.
New geospatial functions: The following new geospatial functions are now supported:
- st_azimuth function: Returns the north-based azimuth from the first point to the second in radians.
- st_boundary function: Returns the boundary of the input geometry.
- st_closestpoint function: Returns the 2D projection of a point on the first geometry closest to the second geometry.
- st_geogfromewkt function: Parses an Extended Well-Known Text (EWKT) description of a geography.
- st_geomfromewkt function: Parses an Extended Well-Known Text (EWKT) description of a geometry.
Improved spatial join performance: Spatial join performance is now improved by introducing shuffled spatial join support.
Improved performance for geospatial functions: Photon implementations are now available for st_isvalid function, st_makeline function, and st_makepolygon function.
EWKT input support: The try_to_geography, try_to_geometry, to_geography, and to_geometry functions now accept Extended Well-Known Text (EWKT) as input.

Behavior changes

FSCK REPAIR TABLE includes metadata repair by default: The FSCK REPAIR TABLE command now includes an initial metadata repair step before checking for missing data files. The command can work on tables with corrupt checkpoints or invalid partition values.
Proration factors aligned between reads and auto-optimized writes: Proration factors for partition sizing now use fractional values consistently across read operations and auto-optimized writes. This change might result in a different number of tasks for read operations.
Python UDF execution unified across PySpark and Unity Catalog: Unity Catalog Python UDFs now use Apache Arrow as the default interchange format, improving overall performance. As part of this change, TIMESTAMP values passed to Python UDFs no longer include timezone information in the datetime object's tzinfo attribute. If your UDF relies on timezone information, restore it with date = date.replace(tzinfo=timezone.utc). See Timestamp timezone behavior for inputs.
Improved error messages for Kafka connector login module issues: When using the Kafka connector with an unshaded login module class, Azure Databricks now provides error messages that suggest using the correct shaded class prefix (kafkashaded.org.apache.kafka or kafkashaded.software.amazon.msk.auth.iam).
Time travel restrictions and VACUUM retention behavior: Azure Databricks now blocks time travel queries beyond the deletedFileRetentionDuration threshold for all tables. The VACUUM command ignores the retention duration argument except when the value is 0 hours. You can't set deletedFileRetentionDuration larger than logRetentionDuration or vice versa.
BinaryType maps to bytes by default in PySpark: In PySpark, BinaryType now consistently maps to Python bytes. Previously, PySpark mapped BinaryType to either bytes or bytearray depending on the context. To restore the old behavior, set spark.sql.execution.pyspark.binaryAsBytes to false.
Partition columns materialized in Parquet files: Partitioned Delta tables now materialize partition columns in newly written Parquet data files. Previously, partition values were stored in the Delta transaction log metadata and reflected in directory paths, but not written as columns in the Parquet files themselves. This change might affect workloads that directly read Parquet files written by Delta.
DESCRIBE TABLE output includes metadata column: The output of DESCRIBE TABLE [EXTENDED] now includes a new metadata column for all table types, containing semantic metadata (display name, format, and synonyms) defined on the table as a JSON string.

Serverless environment version 5 is now available

February 25, 2026

You can now use serverless environment version 5 in your serverless notebooks and jobs. This includes both the CPU and GPU environment versions. See Serverless environment version 5.

Version 17.3

October 28, 2025

This serverless compute release roughly corresponds to Databricks Runtime 17.3 LTS.

New features

LIMIT ALL support for recursive CTEs: You can now use the LIMIT ALL clause with recursive common table expressions (rCTEs) to explicitly specify that no row limit should be applied to the query results. See Common table expression (CTE).
Appending to files in Unity Catalog volumes returns correct error: Attempting to append to existing files in Unity Catalog volumes now returns a more descriptive error message to help you understand and resolve the issue.
st_dump function support: You can now use the st_dump function to decompose a geometry object into its constituent parts, returning a set of simpler geometries. See st_dump function.
Polygon interior ring functions are now supported: You can now use the following functions to work with polygon interior rings:
- st_numinteriorrings: Get the number of inner boundaries (rings) of a polygon. See st_numinteriorrings function.
- st_interiorringn: Extract the n-th inner boundary of a polygon and return it as a linestring. See st_interiorringn function.
EXECUTE IMMEDIATE using constant expressions: The EXECUTE IMMEDIATE statement now supports using constant expressions in the query string, allowing for more flexible dynamic SQL execution. See EXECUTE IMMEDIATE.
Allow spark.sql.files.maxPartitionBytes in serverless compute: You can now configure the spark.sql.files.maxPartitionBytes Spark configuration parameter on serverless compute to control the maximum number of bytes to pack into a single partition when reading files. See Configure Spark properties for serverless notebooks and jobs.

Behavior changes

Support MV/ST refresh information in DESCRIBE EXTENDED AS JSON: The DESCRIBE EXTENDED AS JSON command now includes refresh information for materialized views and streaming tables, providing visibility into the last refresh time and status.
Add metadata column to DESCRIBE QUERY and DESCRIBE TABLE: The DESCRIBE QUERY and DESCRIBE TABLE commands now include a metadata column in their output, providing additional information about each column's properties and characteristics.
Correct handling of null structs when dropping NullType columns: Azure Databricks now correctly handles null struct values when dropping columns with NullType, preventing potential data corruption or unexpected behavior.
Improved handling of null structs in Parquet: This release includes improvements to how null struct values are handled when reading from and writing to Parquet files, ensuring more consistent and correct behavior.
Upgrade aws-msk-iam-auth library for Kafka: The aws-msk-iam-auth library used for Amazon MSK IAM authentication has been upgraded to the latest version, providing improved security and compatibility.

Version 17.2

September 25, 2025

This serverless compute release roughly corresponds to Databricks Runtime 17.2.

New features

ST_ExteriorRing function is now supported: You can now use the ST_ExteriorRing function to extract the outer boundary of a polygon and return it as a linestring. See st_exteriorring function.

Support TEMPORARY keyword for metric view creation: You can now use the TEMPORARY keyword when creating a metric view. Temporary metric views are visible only in the session that created them and are dropped when the session ends. See CREATE VIEW.
Use native I/O for LokiFileSystem.getFileStatus on S3: LokiFileSystem.getFileStatus now uses the native I/O stack for Amazon S3 traffic and returns org.apache.hadoop.fs.FileStatus objects instead of shaded.databricks.org.apache.hadoop.fs.s3a.S3AFileStatus.
Auto Loader infers partition columns in singleVariantColumn mode: Auto Loader now infers partition columns from file paths when ingesting data as a semi-structured variant type using the singleVariantColumn option. Previously, partition columns were not automatically detected. See Auto Loader options.

Behavior changes

DESCRIBE CONNECTION shows environment settings for JDBC connections: Azure Databricks now includes user-defined environment settings in the DESCRIBE CONNECTION output for JDBC connections that support custom drivers and run in isolation. Other connection types remain unchanged.
Option to truncate uniform history during managed tables migration: You can now truncate uniform history when migrating tables with Uniform/Iceberg enabled using ALTER TABLE...SET MANAGED. This simplifies migrations and reduces downtime compared to disabling and re-enabling Uniform manually.
Correct results for split with empty regex and positive limit: Azure Databricks now returns correct results when using split function with an empty regex and a positive limit. Previously, the function incorrectly truncated the remaining string instead of including it in the last element.
Fix url_decode and try_url_decode error handling in Photon: In Photon, try_url_decode() and url_decode() with failOnError = false now return NULL for invalid URL-encoded strings instead of failing the query.
Shared execution environment for Unity Catalog Python UDTFs: Azure Databricks now shares the execution environment for Python user-defined table functions (UDTFs) from the same owner and Spark session. An optional STRICT ISOLATION clause is available to disable sharing for UDTFs with side effects, such as modifying environment variables or executing arbitrary code.

Version 17.1

August 19, 2025

This serverless compute release roughly corresponds to Databricks Runtime 17.1.

New features

Reduced memory usage for wide schemas in Photon writer: Enhancements were made to the Photon engine that significantly reduce memory usage for wide schemas, addressing scenarios that previously resulted in out-of-memory errors.

Behavior changes

Error thrown for invalid CHECK constraints: Azure Databricks now throws an AnalysisException if a CHECK constraint expression cannot be resolved during constraint validation.
Pulsar connector no longer exposes Bouncy Castle: The Bouncy Castle library is now shaded in the Pulsar connector to prevent classpath conflicts. As a result, Spark jobs can no longer access org.bouncycastle.* classes from the connector. If your code depends on Bouncy Castle, install the library manually on serverless environment.
Auto Loader uses file events by default if available: Auto Loader uses file events instead of directory listing when the load path is an external location with file events enabled. The default for useManagedFileEvents is now if_available (was false). This can improve ingestion performance and logs a warning if file events are not yet enabled.
Teradata connector fixes case-sensitive string comparison: The Teradata connector now defaults to TMODE=ANSI, aligning string comparison behavior with Azure Databricks by making it case-sensitive. This change is configurable and does not affect existing users unless they opt in.

Serverless environment version 4

August 13, 2025

Environment version 4 is now available in your serverless notebooks and jobs. This environment version includes library upgrades and API updates. See Serverless environment version 4.

Version 17.0

July 24, 2025

This serverless compute release roughly corresponds to Databricks Runtime 17.0.

New features

SQL procedure support: SQL scripts can now be encapsulated in a procedure stored as a reusable asset in Unity Catalog. You can create a procedure using the CREATE PROCEDURE command, and then call it using the CALL command.
Set a default collation for SQL Functions: Using the new DEFAULT COLLATION clause in the CREATE FUNCTION command defines the default collation used for STRING parameters, the return type, and STRING literals in the function body.
Recursive common table expressions (rCTE) support: Azure Databricks now supports navigation of hierarchical data using recursive common table expressions (rCTEs). Use a self-referencing CTE with UNION ALL to follow the recursive relationship.
PySpark and Spark Connect now support the DataFrames df.mergeInto API: PySpark and Spark Connect now support the df.mergeInto API.
Support ALL CATALOGS in SHOW SCHEMAS: The SHOW SCHEMAS syntax is updated to accept ALL CATALOGS, allowing you to iterate through all active catalogs that support namespaces. The output attributes now include a catalog column indicating the catalog of the corresponding namespace.
Liquid clustering now compacts deletion vectors more efficiently: Delta tables with liquid clustering now apply physical changes from deletion vectors more efficiently when OPTIMIZE is running. For more details, see Apply changes to Parquet data files.
Allow non-deterministic expressions in UPDATE/INSERT column values for MERGE operations: Azure Databricks now allows the use of non-deterministic expressions in updated and inserted column values of MERGE operations. For example, you can now generate dynamic or random values for columns using expressions like rand().
Change Delta MERGE Python APIs to return DataFrame instead of Unit: The Python MERGE APIs (such as DeltaMergeBuilder) now also return a DataFrame like the SQL API does, with the same results.

Behavior changes

Behavioral change for the Auto Loader incremental directory listing option: The value of the deprecated Auto Loader cloudFiles.useIncrementalListing option is now set to a default value of false. As a result, this change causes Auto Loader to perform a full directory listing each time it's run. Databricks recommends against using this option. Instead, use file notification mode with file events.
CREATE VIEW column-level clauses now throw errors when the clause would only apply to materialized views: CREATE VIEW commands that specify a column-level clause that is only valid for MATERIALIZED VIEWs now throw an error. The affected clauses include NOT NULL, specified datatypes, DEFAULT, and COLUMN MASK.

Serverless performance targets is GA

June 10, 2025

Selecting the serverless performance setting for jobs and pipelines is now generally available.

When the Performance optimized setting is enabled, your workload is optimized for faster startup and execution time. When disabled, the serverless workload runs on standard performance mode, which is optimized for cost and has a slightly higher launch latency.

For more information, see Select a performance mode and Select a performance mode.

Version 16.4

May 28, 2025

This serverless compute release roughly corresponds to Databricks Runtime 16.4 LTS.

Behavior changes

Fix to respect options for data source cached plans: This update ensures table reads respect options set for all data source plans when cached, not just the first cached table read. Previously, data source table reads cached the first plan but failed to account for different options in subsequent queries.
Enable flag to require source materialization for MERGE operations: Previously, users could turn off source materialization in MERGE by setting merge.materializeSource to none. With the new flag enabled, source materialization will always be required, and attempts to disable it will result in an error. Databricks plans to enable this flag only for customers who have not previously changed this configuration, so most users should not experience any change in behavior.

New features

Auto Loader can now clean processed files in the source directory: You can now instruct Auto Loader to automatically move or delete files that have been processed. Opt in to this feature by using the cloudFiles.cleanSource Auto Loader option. See Auto Loader options, under cloudFiles.cleanSource.
Type widening support added for streaming from Delta tables: This release adds support for streaming from a Delta table that has type-widened column data, and for sharing a Delta table with type widening enabled using Databricks-to-Databricks Delta Sharing. The type widening feature is currently in Public Preview. See Type widening.
IDENTIFIER support now available in DBSQL for catalog operations: You can now use the IDENTIFIER clause when performing the following catalog operations:
- CREATE CATALOG
- DROP CATALOG
- COMMENT ON CATALOG
- ALTER CATALOG
This new syntax allows you to dynamically specify catalog names using parameters defined for these operations, enabling more flexible and reusable SQL workflows. As an example of the syntax, consider CREATE CATALOG IDENTIFIER(:param) where param is a parameter provided to specify a catalog name. See IDENTIFIER clause.
Collated expressions now provide autogenerated transient aliases: Autogenerated aliases for collated expressions now deterministically incorporate COLLATE information. Autogenerated aliases are transient (unstable) and should not be relied on. Instead, as a best practice, use expression AS alias consistently and explicitly.
Add filter pushdown API support to Python data sources: Serverless compute now supports filter pushdown to Python data source batch read as an a API, similar to SupportsPushDownFilters interface. See 16.4 LTS release notes.
Python UDF traceback improvement: The Python UDF traceback now includes frames from both the driver and executor along with client frames, resulting in better error messages that show greater and more relevant details (such as the line content of frames inside a UDF).
UNION/EXCEPT/INTERSECT inside a view and EXECUTE IMMEDIATE now return correct results: Queries for temporary and persistent view definitions with top-level UNION/EXCEPT/INTERSECT and un-aliased columns previously returned incorrect results because UNION/EXCEPT/INTERSECT keywords were considered aliases. Now those queries will correctly perform the whole set operation.
Data source cached plan conf and migration guide: Reading from a file source table will correctly respect query options (for example delimiters). Previously, the first query plan was cached and subsequent option changes ignored. To restore the previous behavior, set spark.sql.legacy.readFileSourceTableCacheIgnoreOptions to true.
New listagg and string_agg functions: Starting with this release you can use the listagg or string_agg functions to aggregate STRING and BINARY values within a group. See string_agg.

Performance mode is now configurable on serverless jobs

April 14, 2025

You can now select the performance mode of a serverless job using the Performance optimized setting in the job details page. Previously, all serverless jobs were performance optimized. Now, you can disable the Performance optimized setting to run the workload on standard performance mode. Standard peformance mode is designed to reduce costs on workloads where a slightly higher launch latency is acceptable.

Standard performance mode is not supported for continuous pipelines, one-time runs created using the runs/submit endpoint, or SQL warehouse job tasks, including materialized views.

For more on performance mode, see Select a performance mode.

Version 16.3

April 9, 2025

This serverless compute release roughly corresponds to Databricks Runtime 16.3.

Behavior changes

*Improved error message when kafka.sasl.client.callback.handler.class is assigned an invalid value: This release includes a change to return a more descriptive error message when kafka.sasl.client.callback.handler.class is assigned an invalid value.

New features

State reader support is GA: Support for reading state information for Structured Streaming queries is now generally available on serverless compute. See Read Structured Streaming state information.
Delta table protocol downgrade is GA with checkpoint protection: DROP FEATURE is generally available to remove Delta Lake table features and downgrade the table protocol. By default, DROP FEATURE now creates protected checkpoints for a more optimized and simplified downgrade experience that does not require any waiting time or history truncation. See Drop a Delta Lake table feature and downgrade table protocol.
Write procedural SQL scripts based on ANSI SQL/PSM (Public Preview): You can now use scripting capabilities based on ANSI SQL/PSM to write procedural logic with SQL, including control flow statements, local variables, and exception handling. See SQL scripting.
Table and view level default collation: You can now specify a default collation for tables and views. This simplifies the creation of table and views where all or most columns share the same collation. See Collation.
New H3 functions: Three new H3 functions have been added: h3_try_coverash3, h3_try_coverash3string, and h3_try_tessellateaswkb.
Alter multiple table columns in one ALTER TABLE statement: You can now alter multiple columns in a single ALTER TABLE statement. See ALTER TABLE ... COLUMN clause.

Version 16.2

March 13, 2025

This serverless compute release roughly corresponds to Databricks Runtime 16.2.

Behavior changes

In Delta Sharing, table history is enabled by default: Shares created using the SQL command ALTER SHARE <share> ADD TABLE <table> now have history sharing (WITH HISTORY) enabled by default. See ALTER SHARE.
Credential SQL statements return an error when there's a credential type mismatch: Now, if the credential type specified in a credential management SQL statement doesn't match the type of the credential argument, an error is returned and the statement is not run.

New features

Use the timestampdiff & timestampadd in generated column expressions You can now use the timestampdiff and timestampadd functions in Delta Lake generated column expressions. See Delta Lake generated columns.
Update to DESCRIBE TABLE returns metadata as structured JSON: You can now use the DESCRIBE TABLE AS JSON command to return table metadata as a JSON document. The JSON output is more structured than the default human-readable report and can be used to interpret a table's schema programmatically. To learn more, see DESCRIBE TABLE AS JSON.
Trailing blank insensitive collations: Serverless now supports trailing blank insensitive collations. For example, these collations treat 'Hello' and 'Hello ' as equal. To learn more, see RTRIM collation.

Bug fixes

Improved incremental clone processing: This release includes a fix for an edge case where an incremental CLONE might re-copy files already copied from a source table to a target table. See Clone a table on Azure Databricks.

High memory setting available on serverless notebooks (Public Preview)

February 7, 2025

You can now configure a higher memory size for your serverless compute notebook workloads. This setting can be applied to both interactive and scheduled notebook workloads.

Serverless usage with high memory has a higher DBU emission rate than standard memory.

For more information, see Use high memory serverless compute.

Version 16.1

February 5, 2025

This serverless compute release roughly corresponds to Databricks Runtime 16.0 and Databricks Runtime 16.1.

New features

Avro support for recursive schema: You can now use the recursiveFieldMaxDepth option with the from_avro function and the avro data source. This option sets the maximum depth for schema recursion on the Avro data source. See Read and write streaming Avro data.
Expanded support for Confluent Schema Registry for Avro: Serverless now supports Avro schema reference with the Confluent Schema Registry. See Authenticate to an external Confluent Schema Registry.
Force reclustering on tables with liquid clustering: You can now use the OPTIMIZE FULL syntax to force the reclustering of all records in a table with liquid clustering enabled. See Force reclustering for all records.
The Delta APIs for Python now support identity columns: You can now use the Delta APIs for Python to create tables with identity columns. See Use identity columns in Delta Lake.
Create liquid clustered tables during streaming writes: You can now use clusterBy to enable liquid clustering when creating new tables with Structured Streaming writes. See Enable liquid clustering.
Support for the OPTIMIZE FULL clause: Serverless compute now supports the OPTIMIZE FULL clause. This clause optimizes all records in a table that uses liquid clustering, including data that might have previously been clustered.
Support for WITH options specification in INSERT and table-reference: Serverless compute now supports an options specification for table references and table names of an INSERT statement which can be used to control the behavior of data sources.
New SQL functions: The following SQL functions are now available on serverless compute:
- try_url_decode is an error-tolerant version of url_decode.
- zeroifnull returns 0 if the input expression to the zeroifnull() function is NULL.
- nullifzero returns NULL if the input is 0 or its input if it is not 0.
- dayname(expr) returns the three-letter English acronym for the day of the week for the given date.
- uniform(expr1, expr2 [,seed]) returns a random value with independent and identically distributed values within the specified range of numbers.
- randstr(length) returns a random string of length alpha-numeric characters.
Enable automatic schema evolution when merging data into a Delta table: Support has been added for the withSchemaEvolution() member of the DeltaMergeBuilder class. Use withSchemaEvolution() to enable automatic schema evolution during MERGE operations. For example, mergeBuilder.whenMatched(...).withSchemaEvolution().execute()}}.
Support for collations in Apache Spark is in Public Preview: You can now assign language-aware, case-insensitive, and access-insensitive collations to STRING columns and expressions. These collations are used in string comparisons, sorting, grouping operations, and many string functions. See Collation.
Support for collations in Delta Lake is in Public Preview: You can now define collations for columns when creating or altering a Delta table. See Collation support for Delta Lake.
LITE mode for vacuum is in Public Preview: You can now use VACUUM table_name LITE to perform a lighter-weight vacuum operation that leverages metadata in the Delta transaction log. See Full versus lite mode and VACUUM.
Support for parameterizing the USE CATALOG with IDENTIFIER clause: The IDENTIFIER clause is now supported for the USE CATALOG statement. With this support, you can parameterize the current catalog based on a string variable or parameter marker.
COMMENT ON COLUMN support for tables and views: The COMMENT ON statement now supports altering comments for view and table columns.
Named parameter invocation for more functions: The following functions support named parameter invocation:
The SYNC METADATA parameter to the REPAIR TABLE command is supported with the Hive metastore: You can now use the SYNC METADATA parameter with the REPAIR TABLE command to update the metadata of a Hive metastore managed table. See REPAIR TABLE.
Enhanced data integrity for compressed Apache Arrow batches: To further protect against data corruption, every LZ4 compressed Arrow batch now includes the LZ4 content and block checksums. See LZ4 Frame Format Description.

Built-in Oracle JDBC Driver: Serverless compute now has the Oracle JDBC Driver built in. If you use a customer-uploaded JDBC driver JAR via DriverManager, you must rewrite scripts to explicitly use the custom JAR. Otherwise, the built-in driver is used. This driver only supports Lakehouse Federation. For other use cases, you must provide your own driver.
More detailed errors for Delta tables accessed with paths: A new error message experience for Delta tables accessed using paths is now available. All exceptions are now forwarded to the user. The exception DELTA_MISSING_DELTA_TABLE is now reserved for when underlying files cannot be read as a Delta table.

Behavior changes

Breaking change: Hosted RStudio is end-of-life: With this release, Databricks-hosted RStudio Server is end-of-life and unavailable on any Azure Databricks workspace running on serverless compute. To learn more and see a list of alternatives to RStudio, see Connect to a Databricks-hosted RStudio Server.

Breaking change: Removal of support for changing byte, short, int and long types to wider types: To ensure consistent behavior across Delta and Apache Iceberg tables, the following data type changes can no longer be applied to tables with the type widening feature enabled:
- byte, short, int and long to decimal.
- byte, short, and int to double.
Correct parsing of regex patterns with negation in nested character grouping: This release includes a change to support the correct parsing of regex patterns with negation in nested character grouping. For example, [^[abc]] will be parsed as “any character that is NOT one of 'abc'”.

Additionally, Photon behavior was inconsistent with Spark for nested character classes. Regex patterns containing nested character classes will no longer use Photon, and instead will use Spark. A nested character class is any pattern containing square brackets within square brackets, such as [[a-c][1-3]].
Improve duplicate match detection in Delta Lake MERGE: MERGE now considers conditions specified in the WHEN MATCHED clause. See Upsert into a Delta Lake table using merge.
The addArtifact() functionality is now consistent across compute types: When you use addArtifact(archive = True) to add a dependency to serverless compute, the archive is automatically unpacked.

Bug fixes

Timezone offsets now include seconds when serialized to CSV, JSON, and XML: Timestamps with timezone offsets that include seconds (common for timestamps from before 1900) were omitting the seconds when serialized to CSV, JSON, and XML. The default timestamp formatter has been fixed and now returns the correct offset values for these timestamps.

Other changes

Renamed error codes for the cloudFiles Structured Streaming source: The following error codes have been renamed:
- _LEGACY_ERROR_TEMP_DBR_0143 is renamed to CF_INCORRECT_STREAM_USAGE.
- _LEGACY_ERROR_TEMP_DBR_0260 is renamed to CF_INCORRECT_BATCH_USAGE .

Version 15.4

October 28, 2024

This serverless compute release roughly corresponds to Databricks Runtime 15.4

New features

UTF-8 validation functions: This release introduces the following functions for validating UTF-8 strings:
- is_valid_utf8 verified whether a string is a valid UTF-8 string.
- make_valid_utf8 converts a potentially invalid UTF-8 string to a valid UTF-8 string using substitution characters.
- validate_utf8 raises an error if the input is not a valid UTF-8 string.
- try_validate_utf8 returns NULL if the input is not a valid UTF-8 string.
Enable UniForm Iceberg using ALTER TABLE: You can now enable UniForm Iceberg on existing tables without rewriting data files. See Enable Iceberg reads on an existing table.
try_url_decode function: This release introduces the try_url_decode function, which decodes a URL-encoded string. If the string is not in the correct format, the function returns NULL instead of raising an error.
Optionally allow the optimizer to rely on unenforced foreign key constraints: To improve query performance, you can now specify the RELY keyword on FOREIGN KEY constraints when you CREATE or ALTER a table.
Parallelized job runs for selective overwrites: Selective overwrites using replaceWhere now run jobs that delete data and insert new data in parallel, improving query performance and cluster utilization.
Improved performance for change data feed with selective overwrites: Selective overwrites using replaceWhere on tables with change data feed no longer write separate change data files for inserted data. These operations use a hidden _change_type column present in the underlying Parquet data files to record changes without write amplification.
Improved query latency for the COPY INTO command: This release includes a change that improves the query latency for the COPY INTO command. This improvement is implemented by making the loading of state by the RocksDB state store asynchronous. With this change, you should see an improvement in start times for queries with large states, such as queries with a large number of already ingested files.
Support for dropping the check constraints table feature: You can now drop the checkConstraints table feature from a Delta table using ALTER TABLE table_name DROP FEATURE checkConstraints. See Disable check constraints.

Behavior changes

Schema binding change for views: When the data types in a view's underlying query change from those used when the view was first created, Databricks no longer throws errors for references to the view when no safe cast can be performed.

Instead, the view compensates by using regular casting rules where possible. This change allows Databricks to tolerate table schema changes more readily.
Disallow undocumented ! syntax toleration for NOT outside boolean logic: Databricks will no longer tolerate the use of ! as a synonym for NOT outside of boolean logic. This change reduces confusion, aligns with the SQL standard, and makes SQL more portable. For example:

CREATE ... IF ! EXISTS, IS ! NULL, ! NULL column or field property, ! IN and ! BETWEEN must be replaced with:

CREATE ... IF NOT EXISTS, IS NOT NULL, NOT NULL column or field property, NOT IN and NOT BETWEEN.

The boolean prefix operator ! (e.g. !is_mgr or !(true AND false)) is unaffected by this change.
Disallow undocumented and unprocessed portions of column definition syntax in views: Databricks supports CREATE VIEW with named columns and column comments.

The specification of column types, NOT NULL constraints, or DEFAULT has been tolerated in the syntax without having any effect. Databricks will remove this syntax toleration. Doing so reduces confusion, aligns with the SQL standard, and allows for future enhancements.
Consistent error handling for Base64 decoding in Spark and Photon: This release changes how Photon handles Base64 decoding errors to match the Spark handling of these errors. Before these changes, the Photon and Spark code generation path sometimes failed to raise parsing exceptions, while the Spark interpreted execution correctly raised IllegalArgumentException or ConversionInvalidInputError. This update ensures that Photon consistently raises the same exceptions as Spark during Base64 decoding errors, providing more predictable and reliable error handling.
Adding a CHECK constraint on an invalid column now returns the UNRESOLVED_COLUMN.WITH_SUGGESTION error class: To provide more useful error messaging, in Databricks Runtime 15.3 and above, an ALTER TABLE ADD CONSTRAINT statement that includes a CHECK constraint referencing an invalid column name returns the UNRESOLVED_COLUMN.WITH_SUGGESTION error class. Previously, an INTERNAL_ERROR was returned.

The JDK is upgraded from JDK 8 to JDK 17

August 15, 2024

Serverless compute for notebooks and workflows has migrated from Java Development Kit (JDK) 8 to JDK 17 on the server side. This upgrade includes the following behavioral changes:

Correct parsing of regex patterns with negation in nested character grouping: With this upgrade, Azure Databricks now supports the correct parsing of regex patterns with negation in nested character grouping. For example, [^[abc]] will be parsed as “any character that is NOT one of 'abc'”.

Additionally, Photon behavior was inconsistent with Spark for nested character classes. Regex patterns containing nested character classes will no longer use Photon, and instead will use Spark. A nested character class is any pattern containing square brackets within square brackets, such as [[a-c][1-3]].

Version 15.1

July 23, 2024

This serverless compute release roughly corresponds to Databricks Runtime 15.1

New features

Support for star (*) syntax in the WHERE clause: You can now use the star (*) syntax in the WHERE clause to reference all columns from the SELECT list.

For example, SELECT * FROM VALUES(1, 2) AS T(a1, a2) WHERE 1 IN(T.*).

Changes

Improved error recovery for JSON parsing: The JSON parser used for from_json() and JSON path expressions now recovers faster from malformed syntax, resulting in less data loss.

When encountering malformed JSON syntax in a struct field, an array value, a map key, or a map value, the JSON parser will now return NULL only for the unreadable field, key, or element. Subsequent fields, keys, or elements will be properly parsed. Prior to this change, the JSON parser abandoned parsing the array, struct, or map and returned NULL for the remaining content.

Version 14.3

April 15, 2024

This is the initial serverless compute version. This version roughly corresponds to Databricks Runtime 14.3 with some modifications that remove support for some non-serverless and legacy features.

Supported Spark configuration parameters

To automate the configuration of Spark on serverless compute, Azure Databricks has removed support for manually setting most Spark configurations. To view a list of supported Spark configuration parameters, see Configure Spark properties for serverless notebooks and jobs.

Job runs on serverless compute will fail if you set an unsupported Spark configuration.

input_file functions are deprecated

The input_file_name(), input_file_block_length(), and input_file_block_start() functions have been deprecated. Using these functions is highly discouraged.

Instead, use the file metadata column to retrieve file metadata information.

Behavioral changes

Serverless compute version 2024.15 includes the following behavioral changes:

unhex(hexStr) bug fix: When using the unhex(hexStr) function, hexStr is always padded left to a whole byte. Previously the unhex function ignored the first half-byte. For example: unhex('ABC') now produces x'0ABC' instead of x'BC'.
Auto-generated column aliases are now stable: When the result of an expression is referenced without a user-specified column alias, this auto-generated alias will now be stable. The new algorithm may result in a change to the previously auto-generated names used in features like materialized views.
Table scans with CHAR type fields are now always padded: Delta tables, certain JDBC tables, and external data sources store CHAR data in non-padded form. When reading, Azure Databricks will now pad the data with spaces to the declared length to ensure correct semantics.
Casts from BIGINT/DECIMAL to TIMESTAMP throw an exception for overflowed values: Azure Databricks allows casting from BIGINT and DECIMAL to TIMESTAMP by treating the value as the number of seconds from the Unix epoch. Previously, Azure Databricks would return overflowed values but now throws an exception in cases of overflow. Use try_cast to return NULL instead of an exception.
PySpark UDF execution has been improved to match the exact behavior of UDF execution on dedicated compute: The following changes have been made:
- UDFs with a string return type no longer implicitly convert non-string values into strings. Previously, UDFs with a return type of str would apply a str(..) wrapper to the result regardless of the actual data type of the returned value.
- UDFs with timestamp return types no longer implicitly apply a timezone conversion to timestamps.

Tilbakemeldinger

Var denne siden nyttig?

Last updated on 2026-03-20

Del via

Serverless compute release notes

Serverless environment versions

Release notes

Serverless version 18.0

Version 18.0

New features

Behavior changes

Serverless environment version 5 is now available

Version 17.3

New features

Behavior changes

Version 17.2

New features

Behavior changes

Version 17.1

New features

Behavior changes

Serverless environment version 4

Version 17.0

New features

Behavior changes

Serverless performance targets is GA

Version 16.4

Behavior changes

New features

Performance mode is now configurable on serverless jobs

Version 16.3

Behavior changes

New features

Version 16.2

Behavior changes

New features

Bug fixes

High memory setting available on serverless notebooks (Public Preview)

Version 16.1

New features

Behavior changes

Bug fixes

Other changes

Version 15.4

New features

Behavior changes

The JDK is upgraded from JDK 8 to JDK 17

Version 15.1

New features

Changes

Version 14.3

Supported Spark configuration parameters

input_file functions are deprecated

Behavioral changes

Tilbakemeldinger

Flere ressurser