Can I use compressed data on TabularDataset?

Makoto Oda 291 Reputation points
2021-11-30T01:02:30.553+00:00

I have a question about the source of TabularDataset on Azure Machine Learnigng.

Can I use compressed data saved Azure Data Lake Storage Gen2 like below on TablarDataset without expansion?

  • csv with bzip2(.bz2)
  • parquet with gzip(gz)
  • parquet with snappy
Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,352 questions
Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,578 questions
0 comments No comments
{count} votes

Accepted answer
  1. GiftA-MSFT 11,151 Reputation points
    2021-11-30T02:15:20.327+00:00

    Hi, tabular dataset does not support compressed files. You'll need to extract the data as shown here for example before creating a tabular dataset. However, file dataset supports any format.


    --- *Kindly Accept Answer if the information helps. Thanks.*


0 additional answers

Sort by: Most helpful