ETL - Automate Data validation

Santhi Dhanuskodi 265 Reputation points
2024-09-24T09:46:31.7966667+00:00

Hi,

I am lookingfor suggestions on different tools and technologies that can be used to validate destination data after ETL.

my source is CSV/Excel files in storage container/Local machine and destination is Azure SQL DB table.

I have more than 1500 source files for one target table, So I want to automate validation destination.

I want to test the total count, avg and sum and random check of a value in a column etc between source files and destination table.

Looking for a short time implementation tools for this.

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,667 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Nandan Hegde 32,256 Reputation points MVP
    2024-09-24T10:28:50.6066667+00:00

    you can use Greatexpectations : open source for data validations

    https://greatexpectations.io/expectations

    this can be triggered via databricks,azure functions etc


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.