I used "path//.parquet" in the "Path" field and now it works.
How can I create a dataset in Azure ML studio (through the GUI) from a parquet file created with Azure Spark
Nastasia Saby
206
Reputation points
I'm trying to load files as a dataset in the GUI of Azure ML Studio. These parquet files have been created through Spark.
In my folder, Spark creates files such as "_SUCCESS" or "_committed_8998000".
Azure ML Studio is not able to read them or ignore them and tells me:
The provided file(s) have invalid byte(s) for the specified file encoding.
{
"message": " "
}
I selected "Ignore unmatched files path" and yet, it still does not work.
If I remove the "_SUCCESS" and other Spark files, it works.
Does anyone have an idea about a workaround?
Thank you.