AutoML automatic data preprocessing?

Hyun Jae Cho 6 Reputation points
2021-01-08T10:46:22.47+00:00

I used a dataset that contains missing values, and Auto ML reached over 90% accuracy. I am curious how Auto ML dealt with missing values and if there is a way to retrieve the preprocessed dataset that Auto ML created? Or does it ignore rows with missing data?

Additionally, I selected "enable deep learning" when creating my Auto ML instance, but when I look at the models tried after the process completes, I do not see deep learning models have been tried. Why is that? I see random forest, LightGBM, XG boost, but no deep neural nets.

Thank you.

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
3,334 questions
0 comments No comments
{count} vote

1 answer

Sort by: Most helpful
  1. Ramr-msft 17,826 Reputation points
    2021-01-08T16:39:30.077+00:00

    @Hyun Jae Cho Thanks for the question. Here is the doc for Auto ML Data prep / feature engineering.

    54932-image.png
    Missing value imputaion: Mean, median, mode, one hot imputation marker.

    Are you using the SDK or UI? We are working on a progress bar which will show up when training DNNs. Until that is released, there are couple of different ways to verify DNNs in the model:

    1. Logs in portal – azureml_automl.log will have printed statement for pretrained transformer or bilstm transformer. You can search for the string “Added” in the logs, which will tell what transformers were added in featurization. When BERT or BiLSTM is chosen, “pretrained” or “bilstm” transformer will appear in this log line
    2. After training the model, download the model and if you have all required dependencies locally available, you can unpickle the downloaded model and look into its featurization steps and it will list pretrained or bilstm transformer when the model pipeline includes it.

    • ML Interpretability dashboard supported to understand feature importance for all models except ForecastTCN.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.