TabularDataset "topandasdataframe()" - does not support pandas errorbadlines?

Sriram Narayanan 21 Reputation points
2020-11-17T04:42:49.59+00:00

Hello,

I am trying to skip lines that produces more columns than intended while loading to a pandas dataframe.

Like this Pandas Option: When error_bad_lines = False, pandas will skip these lines.

40293-pictravel.jpg

How can I achieve this with to-pandas-dataframe? Thanks.

https://learn.microsoft.com/en-us/python/api/azureml-core/azureml.data.tabulardataset?view=azure-ml-py#to-pandas-dataframe-on-error--null---out-of-range-datetime--null--

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,842 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. romungi-MSFT 44,761 Reputation points Microsoft Employee
    2020-11-18T07:30:31.037+00:00

    @Sriram Narayanan As mentioned in the referred documentation link this option is not available with to_pandas_dataframe(), you can only use on_error='null' to replace the failed values with null or fail with exception if 'fail' is used.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.