While ADLS Gen2 is schema-on-read , it helps to organize your data to reflect the star schema. For example:
-
/fact_sales/
-
/dim_customers/
-
/dim_products/
- ...
So in your case, Azure Synapse Analytics can be used to query and analyze data on ADLS Gen2. Ensure you have an Azure Synapse Workspace and necessary components.
Synapse can natively integrate with ADLS Gen2. You'll just want to create linked services and datasets for your data.
Using either serverless SQL pool or provisioned SQL pool, you can develop and run T-SQL queries on your data in ADLS, treating them as if they were in a traditional star schema setup.
Here are some links that can help you :
- Azure Data Lake Storage Gen2: ADLS Gen2 Documentation
- Azure Synapse Analytics: Azure Synapse Analytics Documentation
- Serverless SQL pool: Using Serverless SQL pool with ADLS Gen2