Azure ML Designer is not able to read dataset that is a csv file in a blob storage

Chandan Agrawal 0 Reputation points
2024-11-28T11:03:21.2333333+00:00

Hi,

I have a pipeline in AzureML designer that reads data from a blob storage and then does a very step e.g. Clean Missing Data.

The data is read in the first step without any issues. I can 'Preview data' also.

But I get the following error as soon as I try to read this data in the subsequent 'Clean Missing data' step.

I dont get this kind of error if I try the above with a 'Sample data' within the designer. I dont get this error when reading the same data using Python in the 'Notebooks'.

I have tried googling a lot but cannot find the setting that the error is talking about.

Somebody please help.

AzureMLException:   Message: Error Code: ScriptExecution.StreamAccess.Unexpected Native Error: error in streaming from input data sources   StreamError(Unknown("Dataflow at inmemory://dataflow/787945681c12f518bad542fa0ed9e555 is not valid.", Some(DataflowInvalid("inmemory://dataflow/787945681c12f518bad542fa0ed9e555", VisitError(ExecutionError(ArgumentError(InvalidArgument { argument: "writer", expected: "delimited|parquet|preppy", actual: "dfd" }))))))) => Dataflow at inmemory://dataflow/787945681c12f518bad542fa0ed9e555 is not valid.   Unknown("Dataflow at inmemory://dataflow/787945681c12f518bad542fa0ed9e555 is not valid.", Some(DataflowInvalid("inmemory://dataflow/787945681c12f518bad542fa0ed9e555", VisitError(ExecutionError(ArgumentError(InvalidArgument { argument: "writer", expected: "delimited|parquet|preppy", actual: "dfd" })))))) => Dataflow at inmemory://dataflow/787945681c12f518bad542fa0ed9e555 is not valid.   DataflowInvalid("inmemory://dataflow/787945681c12f518bad542fa0ed9e555", VisitError(ExecutionError(ArgumentError(InvalidArgument { argument: "writer", expected: "delimited|parquet|preppy", actual: "dfd" })))) Error Message: Got unexpected error: Dataflow at inmemory://dataflow/787945681c12f518bad542fa0ed9e555 is not valid.. DataflowInvalid("inmemory://dataflow/787945681c12f518bad542fa0ed9e555", VisitError(ExecutionError(ArgumentError(InvalidArgument { argument: "writer", expected: "delimited|parquet|preppy", actual: "dfd" }))))| session_id=63f50d89-fe91-4d84-850b-66476d80d200   InnerException None   ErrorResponse {     "error": {         "message": "\nError Code: ScriptExecution.StreamAccess.Unexpected\nNative Error: error in streaming from input data sources\n\tStreamError(Unknown("Dataflow at inmemory://dataflow/787945681c12f518bad542fa0ed9e555 is not valid.", Some(DataflowInvalid("inmemory://dataflow/787945681c12f518bad542fa0ed9e555", VisitError(ExecutionError(ArgumentError(InvalidArgument { argument: "writer", expected: "delimited|parquet|preppy", actual: "dfd" })))))))\n=> Dataflow at inmemory://dataflow/787945681c12f518bad542fa0ed9e555 is not valid.\n\tUnknown("Dataflow at inmemory://dataflow/787945681c12f518bad542fa0ed9e555 is not valid.", Some(DataflowInvalid("inmemory://dataflow/787945681c12f518bad542fa0ed9e555", VisitError(ExecutionError(ArgumentError(InvalidArgument { argument: "writer", expected: "delimited|parquet|preppy", actual: "dfd" }))))))\n=> Dataflow at inmemory://dataflow/787945681c12f518bad542fa0ed9e555 is not valid.\n\tDataflowInvalid("inmemory://dataflow/787945681c12f518bad542fa0ed9e555", VisitError(ExecutionError(ArgumentError(InvalidArgument { argument: "writer", expected: "delimited|parquet|preppy", actual: "dfd" }))))\nError Message: Got unexpected error: Dataflow at inmemory://dataflow/787945681c12f518bad542fa0ed9e555 is not valid.. DataflowInvalid("inmemory://dataflow/787945681c12f518bad542fa0ed9e555", VisitError(ExecutionError(ArgumentError(InvalidArgument { argument: "writer"

User's image

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
3,000 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,968 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.