An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
Datalake table update slower
Abhishek Gaikwad
196
Reputation points
We have a datalake table which has huge billion of records in parquet format. When we run any SQL queries against this table the queries are slower.
We are facing an issue when update certain colums to null values for this table. However when we copy this table to a new parquet file and then run update against the new parquet file the updates are faster. Can you please confirm what could be the reason the updates are faster when we copy data to a new file and the old file upates takes long.
Azure Data Lake Storage
Azure Data Lake Storage
Sign in to answer