Azure Databricks Automatic Schema Evolution

Question

Azure Databricks Automatic Schema Evolution

Harshit Chandani 40

Definition: I've created a Delta Live Table pipeline to handle the real-time data.
Use Case: I want my delta live table pipeline to handle the schema changes whenever data comes with a different schema. Example Athletes.csv Schema

Name Country Athletes-1.csv Schema Name Country Sport At first, DLT receives the Athletes.csv file and then Athletes-1.csv but here Athletes-1.csv has a different schema as compared to Athletes.csv How to handle this situation.

Implemented Solution: I've set the spark configuration: spark.conf.set("spark.databricks.delta.schema.autoMerge.enabled", "true") Error:
com.databricks.sql.transaction.tahoe.DeltaAnalysisException: Unknown configuration was specified: delta.schema.autoMerge.enabled

Vinodh247 34,666 Reputation points MVP Volunteer Moderator

2024-02-28T10:05:27.3633333+00:00

Harshit Chandani: Just checking if you had a chance to look at this. if so, can you please update it?
Vinodh247 34,666 Reputation points MVP Volunteer Moderator

2024-03-01T13:49:27.08+00:00

Harshit Chandani: Please 'Upvote'(Thumbs-up) and 'Accept' as an answer if the reply was helpful. This will benefit other community members who face the same issue.

1 answer

Your answer

Vinodh247 34,666 Reputation points MVP Volunteer Moderator

2024-02-28T10:05:27.3633333+00:00

Harshit Chandani: Just checking if you had a chance to look at this. if so, can you please update it?
Vinodh247 34,666 Reputation points MVP Volunteer Moderator

2024-03-01T13:49:27.08+00:00

Harshit Chandani: Please 'Upvote'(Thumbs-up) and 'Accept' as an answer if the reply was helpful. This will benefit other community members who face the same issue.

Answer 1

Hi Harshit Chandani,

Thanks for reaching out to Microsoft Q&A.

This error suggests that the delta.schema.autoMerge.enabled configuration setting is not recognized.

First, try using spark.conf.set("spark.databricks.delta.schema.autoMerge.enabled", True) (OR) spark.conf.set("spark.databricks.delta.schema.autoMerge.enabled", "True") and not spark.conf.set("spark.databricks.delta.schema.autoMerge.enabled", "true")
Check if there are any typos or extra space in the configuration parameters.

User's image

Please 'Upvote'(Thumbs-up) and 'Accept' as an answer if the reply was helpful. This will benefit other community members who face the same issue.

Share via

Azure Databricks Automatic Schema Evolution

1 answer

Your answer