SQL server Schema drift - Source Columns keeps changing

Question

SQL server Schema drift - Source Columns keeps changing

RJ 366

Hi there,

I have built a simple ADF framework for ETL pipeline.

Contains a control table which has list of source tables and source table query (pic attached). Using look up activity, I loop thru the table names and truncate existing tables data at target and load/autocreate target tables use data copy activity.

User's image select 1 'ID', 'Source1' SourceSystem, 'dbo' SourceSchema, 'Table1' SourceTableName, 'Select * from Table1' SourceQuery, 'Staging' TargetSchema, 'Table1' TargetTableName union select 2 'ID', 'Source1' SourceSystem, 'dbo' SourceSchema, 'Table2' SourceTableName, 'Select * from Table2' SourceQuery, 'Staging' TargetSchema, 'Table2' TargetTableName union select 3 'ID', 'Source1' SourceSystem, 'dbo' SourceSchema, 'Table3' SourceTableName, 'Select * from Table3' SourceQuery, 'Staging' TargetSchema, 'Table3' TargetTableName

The source systems are adding new columns and sometimes dropping columns at the SQL server source tables.

Failure happened on 'Sink' side. ErrorCode=UserErrorInvalidColumnName,'Type=Microsoft.DataTransfer.Common.Shared.HybridDeliveryException,Message=The column credit_used is not found in target side,Source=Microsoft.DataTransfer.ClientLibrary,'

I ideally dont want to drop 100s of tables (currently im only truncating)

Is there a way or method to keep copying with schema changes without error? Source - SQL server tables to destination Azure SQL server tables even if structure changes? any examples you could refer me to?

1 answer

Your answer

Answer 1

Vinodh247 40,221 MVP Volunteer Moderator

Hi ,

Thanks for reaching out to Microsoft Q&A.

Solution 1:

Use Auto-Create Tables with Flexible Schema Copy

Enable "Auto Create" in Copy Activity
- In your Copy Activity, under the Sink Settings, enable:
- Auto create table: This will create missing tables automatically.
- Allow schema drift: This will allow changes in schema.
Enable "Skip Incompatible Columns"
- Under Mapping, set the Skip incompatible columns option.
- This prevents errors due to missing or extra columns.
Full Load (Truncate & Load) Strategy
- If you are truncating and reloading, the new schema will be considered automatically when reloading

Solution 2:

Use Mapping Data Flows with "Allow Schema Drift"

Use a Mapping Data Flow instead of Copy Activity
- Add a Source transformation and enable Schema Drift to capture all columns dynamically.
- Use a Sink with Allow Schema Drift enabled, ensuring new columns flow without failures.

Solution 3:

Use "Stage and Merge" Strategy

Load into a Staging Table (with dynamic structure)

Instead of loading directly, copy into a wide, flexible staging table.
- The staging table should use a JSON or XML column for unexpected columns.

Use MERGE with Dynamic SQL

Load data into the main table after validating the schema dynamically.

Please feel free to click the 'Upvote' (Thumbs-up) button and 'Accept as Answer'. This helps the community by allowing others with similar queries to easily find the solution.

RJ 366 Reputation points

2025-01-28T16:40:28.4+00:00

@Vinodh247 Thanks for your reply. Not sure if you had seen my source and target are SQL server. Target is Azure SQL. The below errors out when source has new columns added. Any idea what am i missing based on your recommendations above? especially schema drifts.
RJ 366 Reputation points

2025-01-29T16:12:32.9766667+00:00

@Vinodh247 Thanks for your reply. Not sure if you had seen my source and target are SQL server. Target is Azure SQL. The below errors out when source has new columns added. Any idea what am i missing based on your recommendations above? especially schema drifts.
Vinodh247 40,221 Reputation points MVP Volunteer Moderator

2025-01-30T13:56:43.71+00:00
yes, I can see that and you're using autocreate table in adf. However, schema drift is not working because of structural changes in the source tables.

"Auto Create Table" works only for the first run

If a table already exists in azure SQLDB, adf does not modify its structure.

If a new column appears in the source, the copy activity fails because the sink (azure sql) does not have the new column.

schema drift in copy activity is limited

The copy activity does not alter an existing table. It only creates a table if it does not exist. If columns are missing in the target, you get errors like UserErrorInvalidColumnName.

workaround to try:

use pre-copyscript to ALTER the table dynamically before copying. Modify the "Pre-copy script" to add missing columns dynamically. It dynamically adds missing columns before copying data. No need to drop or recreate tables. (or)

Instead of using copy activity, try using mapping data flow. schema drift is handled natively, so new columns from the source will flow into the sink without breaking the pipeline. In source transformation, enable Schema Drift. In Sink Transformation, enable auto mapping and allow schema drift. Ensure "Auto Create Table" is enabled in the sink settings.
Ganesh Gurram 7,235 Reputation points Moderator

2025-01-31T03:52:09.5533333+00:00

@RJ - We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet. In case if you have any resolution please do share that same with the community as it can be helpful to others. Otherwise, will respond with more details and we will try to help.

Share via

SQL server Schema drift - Source Columns keeps changing

1 answer

Your answer