Error while running Azure AutoML job

Praveen Kumar Rajendran 0 Reputation points
2023-12-12T12:46:20.7366667+00:00

I am trying to run a job with a CSV file having 51 rows and 4500+ columns. However, I am encountering the following validation error:

Validation error while fetching data. Error: 
Error Code: ScriptExecution.StreamAccess.Validation
Validation Error Code: InvalidEncoding
Validation Target: TextFile
Failed Step: 4bcb98bc-077b-4d9e-a31c-2cfeed81999b
Error Message: ScriptExecutionException was caused by StreamAccessException.
  StreamAccessException was caused by ValidationException.
    Unable to read file using Unicode (UTF-8). Attempted read range 0:266363. Lines read in the range 44. Decoding error: Unable to translate bytes [FC] at index 371 from specified code page to Unicode.
      Unable to translate bytes [FC] at index 371 from specified code page to Unicode.
| session_id=2820d615-61ca-49c0-9c86-a03a98f2115c

I need suggestions on how to resolve this issue.

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,714 questions
{count} votes

1 answer

Sort by: Most helpful
  1. VasaviLankipalle-MSFT 15,941 Reputation points
    2023-12-12T21:47:34.1666667+00:00

    Hello @Praveen Kumar Rajendran , Thanks for using Microsoft Q&A Platform.

    Looks like there might be some rows in a dataset that may not be parsed due to characters it can't comprehend. Also, you may need to split the CSV file into smaller files and process them separately. This can help to reduce the size of the file and make it easier to process.

    Is it possible to share the code you are working to read the CSV file?

    I hope this helps.

    Regards,
    Vasavi

    0 comments No comments