Export or delete your Machine Learning service workspace data
In Azure Machine Learning, you can export or delete your workspace data with either the portal graphical interface or the Python SDK. This article describes both options.
For information about viewing or deleting personal data, see Azure Data Subject Requests for the GDPR. For more information about GDPR, see the GDPR section of the Microsoft Trust Center and the GDPR section of the Service Trust portal.
This article provides steps about how to delete personal data from the device or service and can be used to support your obligations under the GDPR. For general information about GDPR, see the GDPR section of the Microsoft Trust Center and the GDPR section of the Service Trust portal.
Control your workspace data
The in-product data that Azure Machine Learning stores is available for export and deletion. You can export and delete data with Azure Machine Learning studio, the CLI, and the SDK. Additionally, you can access telemetry data through the Azure Privacy portal.
In Azure Machine Learning, personal data consists of user information in job history documents.
An Azure workspace relies on a resource group to hold the related resources for an Azure solution. When you create a workspace, you have the opportunity to use an existing resource group, or to create a new one. See this page to learn more about Azure resource groups.
Delete high-level resources using the portal
When you create a workspace, Azure creates several resources within the resource group:
- The workspace itself
- A storage account
- A container registry
- An Applications Insights instance
- A key vault
To delete these resources, select them from the list, and choose Delete:
If the resource is configured for soft delete, the data won't actually delete unless you optionally select to delete the resource permanently. For more information, see the following articles:
A confirmation dialog box opens, where you can confirm your choices.
Job history documents might contain personal user information. These documents are stored in the storage account in blob storage, in
/azureml subfolders. You can download and delete the data from the portal.
Export and delete machine learning resources using Azure Machine Learning studio
Azure Machine Learning studio provides a unified view of your machine learning resources - for example, notebooks, data assets, models, and jobs. Azure Machine Learning studio emphasizes preservation of a record of your data and experiments. You can delete computational resources - pipelines and compute resources - right in the browser. For these resources, navigate to the resource in question, and choose Delete.
You can unregister data assets and archive jobs, but these operations don't delete the data. To entirely remove the data, data assets and job data require deletion at the storage level. Storage level deletion happens in the portal, as described earlier. Azure Machine Learning studio can handle individual deletion. Job deletion deletes the data of that job.
Azure Machine Learning studio can handle training artifact downloads from experimental jobs. Choose the relevant Job. Choose Output + logs, and navigate to the specific artifacts you wish to download. Choose ... and Download, or select Download all.
To download a registered model, navigate to the Model and choose Download.
Export and delete resources using the Python SDK
You can download the outputs of a particular job using:
# Retrieved from Azure Machine Learning web UI run_id = 'aaaaaaaa-bbbb-cccc-dddd-0123456789AB' experiment = ws.experiments['my-experiment'] run = next(run for run in ex.get_runs() if run.id == run_id) metrics_output_port = run.get_pipeline_output('metrics_output') model_output_port = run.get_pipeline_output('model_output') metrics_output_port.download('.', show_progress=True) model_output_port.download('.', show_progress=True)
The following machine learning resources can be deleted using the Python SDK:
Learn more about Managing a workspace.