Why are .csv files not directly supported with AutoML?

Brian Van Steen 20 Reputation points
2024-05-14T14:35:58.5866667+00:00

Hello,
For the Azure Automated ML, I wanted to follow a tutorial that used the Microsoft bike rentals dataset. In the tutorial, the .csv file was uploaded directly, and the Automated ML pipeline worked fine.
When I tried using the uploaded .csv file, it was identified as the data asset is not supported.
I eventually figured out how to modify the .csv file to create an MLTable object with the requisite YAML file, and to upload the folder with the .csv and .yaml files.

Is there anyway to use a .csv file directly?Screenshot 2024-05-13 091950

Screenshot 2024-05-13 092106

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,618 questions
{count} votes

Accepted answer
  1. santoshkc 5,165 Reputation points Microsoft Vendor
    2024-05-15T07:52:49.92+00:00

    Hi @Brian Van Steen,

    Thank you for reaching out to Microsoft Q&A forum!

    Azure Automated ML supports various data formats, including CSV. However, it is possible that the CSV file you uploaded was not in the correct format or structure required by Automated ML.

    I tried to repro your issue and able to use .csv files by selecting the data "type" as "Tabular":

    User's image User's image To use a CSV file directly, ensure that it meets the following requirements:

    1. The first row of the CSV file should contain the column headers.
    2. The CSV file should not contain any blank rows or columns.
    3. The CSV file should not contain any special characters or unsupported data types.

    If your CSV file meets these requirements and is still not supported by Automated ML, you can try converting it to a supported format like Parquet or TSV.

    Alternatively, you can create an MLTable object with the requisite YAML file, as you did before, and upload the folder with the CSV and YAML files. This approach allows you to customize the data preprocessing steps and transformations before training the model.

    You can refer to the Azure documentation: Create and manage data assets and how to create an MLTable object for more information.

    I hope this helps! Thank you.

    1 person found this answer helpful.
    0 comments No comments

0 additional answers

Sort by: Most helpful