Share via

TabularDataset "topandasdataframe()" - does not support pandas errorbadlines?

Sriram Narayanan 21 Reputation points
Nov 17, 2020, 4:42 AM

Hello,

I am trying to skip lines that produces more columns than intended while loading to a pandas dataframe.

Like this Pandas Option: When error_bad_lines = False, pandas will skip these lines.

40293-pictravel.jpg

How can I achieve this with to-pandas-dataframe? Thanks.

https://learn.microsoft.com/en-us/python/api/azureml-core/azureml.data.tabulardataset?view=azure-ml-py#to-pandas-dataframe-on-error--null---out-of-range-datetime--null--

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
3,243 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. romungi-MSFT 48,801 Reputation points Microsoft Employee
    Nov 18, 2020, 7:30 AM

    @Sriram Narayanan As mentioned in the referred documentation link this option is not available with to_pandas_dataframe(), you can only use on_error='null' to replace the failed values with null or fail with exception if 'fail' is used.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.