Export or delete your Machine Learning service workspace data

In Azure Machine Learning, you can export or delete your workspace data with either the portal graphical interface or the Python SDK. This article describes both options.

Note

For information about viewing or deleting personal data, see Azure Data Subject Requests for the GDPR. For more information about GDPR, see the GDPR section of the Microsoft Trust Center and the GDPR section of the Service Trust portal.

Note

This article provides steps about how to delete personal data from the device or service and can be used to support your obligations under the GDPR. For general information about GDPR, see the GDPR section of the Microsoft Trust Center and the GDPR section of the Service Trust portal.

Control your workspace data

The in-product data that Azure Machine Learning stores is available for export and deletion. You can export and delete data with Azure Machine Learning studio, the CLI, and the SDK. Additionally, you can access telemetry data through the Azure Privacy portal.

In Azure Machine Learning, personal data consists of user information in job history documents.

An Azure workspace relies on a resource group to hold the related resources for an Azure solution. When you create a workspace, you have the opportunity to use an existing resource group, or to create a new one. See this page to learn more about Azure resource groups.

Delete high-level resources using the portal

When you create a workspace, Azure creates several resources within the resource group:

  • The workspace itself
  • A storage account
  • A container registry
  • An Applications Insights instance
  • A key vault

To delete these resources, select them from the list, and choose Delete:

Important

If the resource is configured for soft delete, the data won't actually delete unless you optionally select to delete the resource permanently. For more information, see the following articles:

Screenshot of portal, with delete icon highlighted.

A confirmation dialog box opens, where you can confirm your choices.

Job history documents might contain personal user information. These documents are stored in the storage account in blob storage, in /azureml subfolders. You can download and delete the data from the portal.

Screenshot of the Azure Machine Learning directory in the storage account, within the portal.

Export and delete machine learning resources using Azure Machine Learning studio

Azure Machine Learning studio provides a unified view of your machine learning resources - for example, notebooks, data assets, models, and jobs. Azure Machine Learning studio emphasizes preservation of a record of your data and experiments. You can delete computational resources - pipelines and compute resources - right in the browser. For these resources, navigate to the resource in question, and choose Delete.

You can unregister data assets and archive jobs, but these operations don't delete the data. To entirely remove the data, data assets and job data require deletion at the storage level. Storage level deletion happens in the portal, as described earlier. Azure Machine Learning studio can handle individual deletion. Job deletion deletes the data of that job.

Azure Machine Learning studio can handle training artifact downloads from experimental jobs. Choose the relevant Job. Choose Output + logs, and navigate to the specific artifacts you wish to download. Choose ... and Download, or select Download all.

To download a registered model, navigate to the Model and choose Download.

Screenshot of studio model page with download option highlighted.

Export and delete resources using the Python SDK

You can download the outputs of a particular job using:

# Retrieved from Azure Machine Learning web UI
run_id = 'aaaaaaaa-bbbb-cccc-dddd-0123456789AB'
experiment = ws.experiments['my-experiment']
run = next(run for run in ex.get_runs() if run.id == run_id)
metrics_output_port = run.get_pipeline_output('metrics_output')
model_output_port = run.get_pipeline_output('model_output')

metrics_output_port.download('.', show_progress=True)
model_output_port.download('.', show_progress=True)

The following machine learning resources can be deleted using the Python SDK:

Type Function Call Notes
Workspace delete Use delete-dependent-resources to cascade the delete
Model delete
ComputeTarget delete
WebService delete

Next steps

Learn more about Managing a workspace.