Incremental Dynamic Historical Data Loading by Year and Month: From SQL On-Premise to Azure Blob Storage
I Would like to extract data from multiple SQL Server On-premises tables to Azure Blob Storage, focusing on a historical load of data dated before 2024-01-15. For each table, the data should be stored in dynamically created folders, organized by year and month, and following the structure 'year/month/tablename.parquet'. This process should be based on the watermark column instead of using utcNow()
This is my currently pipeline. I am using incremental loading using a watermark table:
inside foreach:
Lookup to get old value:
Lookup to get max value:
Copy activity Source query:
SINK:
Has anyone implemented a similar solution, or can you suggest an approach to achieve this?