Configure MLflow for Azure Machine Learning

2024-12-19

This article explains how to configure MLflow to connect to an Azure Machine Learning workspace for tracking, registry management, and deployment.

Azure Machine Learning workspaces are MLflow-compatible, which means they can act as MLflow servers without any extra configuration. Each workspace has an MLflow tracking URI that MLflow can use to connect to the workspace. Azure Machine Learning workspaces are already configured to work with MLflow, so no extra configuration is required.

However, if you work outside Azure Machine Learning, you need to configure MLflow to point to the workspace. Affected environments include your local machine, Azure Synapse Analytics, and Azure Databricks.

Important

When you use Azure compute infrastructure, you don't have to configure the tracking URI. It's automatically configured for you. Environments with automatic configuration include Azure Machine Learning notebooks, Jupyter notebooks that are hosted on Azure Machine Learning compute instances, and jobs that run on Azure Machine Learning compute clusters.

Prerequisites

The MLflow SDK mlflow package and the Azure Machine Learning azureml-mlflow plugin for MLflow. You can use the following command to install this software:
```
pip install mlflow azureml-mlflow
```
Tip

Instead of mlflow, consider using mlflow-skinny. This package is a lightweight MLflow package without SQL storage, server, UI, or data science dependencies. It's recommended for users who primarily need MLflow tracking and logging capabilities but don't want to import the full suite of features, including deployments.
An Azure Machine Learning workspace. To create a workspace, see Create resources you need to get started.
Access permissions for performing MLflow operations in your workspace. For a list of operations and required permissions, see MLflow operations.

Configure the MLflow tracking URI

To do remote tracking, or track experiments running outside Azure Machine Learning, configure MLflow to point to the tracking URI of your Azure Machine Learning workspace.

To connect MLflow to an Azure Machine Learning workspace, you need the tracking URI of the workspace. Each workspace has its own tracking URI, which starts with the protocol azureml://.

Get the tracking URI for your workspace:
- Azure CLI
- Python SDK
- Studio
- Manually
APPLIES TO: Azure CLI ml extension v2 (current)
1. Sign in and configure your workspace:
```
az account set --subscription <subscription-ID>
az configure --defaults workspace=<workspace-name> group=<resource-group-name> location=<location> 
```
2. Get the tracking URI by using the az ml workspace command:
```
az ml workspace show --query mlflow_tracking_uri
```
APPLIES TO: Python SDK azure-ai-ml v2 (current)

You can use the Azure Machine Learning SDK v2 for Python to get the Azure Machine Learning MLflow tracking URI. Ensure that the azure-ai-ml library is installed in your compute instance. Then use the following code to get the unique MLFLow tracking URI that's associated with your workspace.
1. Use an instance of MLClient to sign in to your workspace. There are two options for signing in:
  - The easiest way is to use the workspace configuration file:
    
    from azure.ai.ml import MLClient from azure.identity import DefaultAzureCredential ml_client = MLClient.from_config(credential=DefaultAzureCredential())
    
    Tip
    
    You can download the workspace configuration file by taking the following steps:
    
    Go to Azure Machine Learning studio.
    
    In the upper right corner, select the name of your workspace.
    
    In the Directory + Subscription + Workspace window, select Download config file.
    
    Save the config.json file in the directory that you're working in.
  - Alternatively, you can use your subscription ID, resource group name, and workspace name to sign in:
    
    from azure.ai.ml import MLClient from azure.identity import DefaultAzureCredential # Enter information about your Azure Machine Learning workspace. subscription_id = "<subscription-ID>" resource_group = "<resource-group-name>" workspace_name = "<workspace-name>" ml_client = MLClient(credential=DefaultAzureCredential(), subscription_id=subscription_id, resource_group_name=resource_group, workspace_name=workspace_name)
    
    Important
    
    The DefaultAzureCredential method tries to pull credentials from the available context. But you might want to specify credentials in a different way, for instance by using the web browser in an interactive way. In these cases, you can use InteractiveBrowserCredential or any other method available in the azure.identity package.
2. Get the Azure Machine Learning tracking URI:
```
mlflow_tracking_uri = ml_client.workspaces.get(ml_client.workspace_name).mlflow_tracking_uri
```
Use Azure Machine Learning studio to get the tracking URI:
1. Open Azure Machine Learning studio and use your credentials to sign in.
2. In the upper right corner, select the name of your workspace.
3. In the Directory + Subscription + Workspace window, select View all properties in Azure Portal. The resource page for your workspace opens in the Azure portal.
4. Under Essentials, copy the MLflow tracking URI value.
You can construct the Azure Machine Learning tracking URI manually. You need your subscription ID, the region your workspace is deployed in, your resource group name, and your workspace name. To get the URI, enter those values into the following code:

Warning

If you use a private link-enabled workspace, the MLflow endpoint also uses a private link to communicate with Azure Machine Learning. As a result, the tracking URI uses a format that's different from the one in this article. In this case, you need to use the Azure Machine Learning SDK for Python or the Azure Machine Learning CLI v2 to get the tracking URI.
```
region = "<region>"
subscription_id = "<subscription-ID>"
resource_group = "<resource-group-name>"
workspace_name = "<workspace-name>"

mlflow_tracking_uri = f"azureml://{region}.api.azureml.ms/mlflow/v1.0/subscriptions/{subscription_id}/resourceGroups/{resource_group}/providers/Microsoft.MachineLearningServices/workspaces/{workspace_name}"
```
Configure the tracking URI:
- MLflow SDK
- Environment variables
Use the set_tracking_uri() method to set the MLflow tracking URI to the tracking URI of your workspace.
```
import mlflow

mlflow.set_tracking_uri(mlflow_tracking_uri)
```
In your compute instance, use the following code to set the MLFLOW_TRACKING_URI MLflow environment variable to the tracking URI of your workspace. This assignment makes all interactions with MLflow in that compute instance point to Azure Machine Learning by default. For more information, see Logging functions.
```
MLFLOW_TRACKING_URI=$(az ml workspace show --query mlflow_tracking_uri | sed 's/"//g') 
```
Tip

Some scenarios involve working in a shared environment like an Azure Databricks cluster or an Azure Synapse Analytics cluster. In these cases, it's useful to set the MLFLOW_TRACKING_URI environment variable at the cluster level rather than for each session. Setting the variable at the cluster level automatically configures the MLflow tracking URI to point to Azure Machine Learning for all sessions in the cluster.

Configure authentication

After you set up tracking, you also need to configure the authentication method for the associated workspace.

By default, the Azure Machine Learning plugin for MLflow performs interactive authentication by opening the default browser to prompt for credentials. But the plugin also supports several other authentication mechanisms. The azure-identity package provides this support. This package is installed as a dependency of the azureml-mlflow plugin.

The authentication process tries the following methods, one after another, until one succeeds:

Environment: Account information that's specified via environment variables is read and used for authentication.
Managed identity: If the application is deployed to an Azure host with a managed identity enabled, the managed identity is used for authentication.
Azure CLI: If you use the Azure CLI az login command to sign in, your credentials are used for authentication.
Azure PowerShell: If you use the Azure PowerShell Connect-AzAccount command to sign in, your credentials are used for authentication.
Interactive browser: The user is interactively authenticated via the default browser.

For interactive jobs where there's a user connected to the session, you can rely on interactive authentication. No further action is required.

Warning

Interactive browser authentication blocks code execution when it prompts for credentials. This approach isn't suitable for authentication in unattended environments like training jobs. We recommend that you configure a different authentication mode in those environments.

For scenarios that require unattended execution, you need to configure a service principal to communicate with Azure Machine Learning. For information about creating a service principal, see Configure a service principal.

Use the tenant ID, client ID, and client secret of your service principal in the following code:

MLflow SDK
Environment variables

import os

os.environ["AZURE_TENANT_ID"] = "<Azure-tenant-ID>"
os.environ["AZURE_CLIENT_ID"] = "<Azure-client-ID>"
os.environ["AZURE_CLIENT_SECRET"] = "<Azure-client-secret>"

export AZURE_TENANT_ID="<Azure-tenant-ID>"
export AZURE_CLIENT_ID="<Azure-client-ID>"
export AZURE_CLIENT_SECRET="<Azure-client-secret>"

Tip

When you work in shared environments, we recommend that you configure these environment variables at the compute level. As a best practice, manage them as secrets in an instance of Azure Key Vault.

For instance, in an Azure Databricks cluster configuration, you can use secrets in environment variables in the following way: AZURE_CLIENT_SECRET={{secrets/<scope-name>/<secret-name>}}. For more information about implementing this approach in Azure Databricks, see Reference a secret in an environment variable, or refer to documentation for your platform.

If you'd rather use a certificate than a secret, you can configure the following environment variables:

Set AZURE_CLIENT_CERTIFICATE_PATH to the path of a file that contains the certificate and private key pair in Privacy Enhanced Mail (PEM) or Public-Key Cryptography Standards 12 (PKCS #12) format.
Set AZURE_CLIENT_CERTIFICATE_PASSWORD to the password of the certificate file, if it uses a password.

Configure authorization and permission levels

Some default roles like AzureML Data Scientist and Contributor are already configured to perform MLflow operations in an Azure Machine Learning workspace. If you use a custom role, you need the following permissions:

To use MLflow tracking:
- Microsoft.MachineLearningServices/workspaces/experiments/*
- Microsoft.MachineLearningServices/workspaces/jobs/*
To use the MLflow model registry:
- Microsoft.MachineLearningServices/workspaces/models/*/*

To see how to grant access to your workspace to a service principal that you create or to your user account, see Grant access.

Troubleshoot authentication issues

MLflow tries to authenticate to Azure Machine Learning on the first operation that interacts with the service, like mlflow.set_experiment() or mlflow.start_run(). If you experience issues or unexpected authentication prompts during the process, you can increase the logging level to get more details about the error:

import logging

logging.getLogger("azure").setLevel(logging.DEBUG)

Set experiment name (optional)

All MLflow runs are logged to the active experiment. By default, runs are logged to an experiment named Default that's automatically created for you. You can configure the experiment that's used for tracking.

Tip

When you use the Azure Machine Learning CLI v2 to submit jobs, you can set the experiment name by using the experiment_name property in the YAML definition of the job. You don't have to configure it in your training script. For more information, see YAML: display name, experiment name, description, and tags.

MLflow SDK
Environment variables

Use the MLflow mlflow.set_experiment() command to configure your experiment.

experiment_name = "experiment_with_mlflow"
mlflow.set_experiment(experiment_name)

Use the MLflow MLFLOW_EXPERIMENT_NAME or MLFLOW_EXPERIMENT_ID environment variable to configure your experiment. For more information, see Command-Line Interface or mlflow.start_run.

export MLFLOW_EXPERIMENT_NAME="experiment_with_mlflow"

Configure support for a nonpublic Azure cloud

The Azure Machine Learning plugin for MLflow is configured by default to work with the global Azure cloud. However, you can configure the Azure cloud you're using by setting the AZUREML_CURRENT_CLOUD environment variable:

MLflow SDK
Environment variables

import os

os.environ["AZUREML_CURRENT_CLOUD"] = "AzureChinaCloud"

export AZUREML_CURRENT_CLOUD="AzureChinaCloud"

You can identify the cloud you're using with the following Azure CLI command:

az cloud list

The current cloud has the value IsActive set to True.

Now that your environment is connected to your workspace in Azure Machine Learning, you can start to work with it.

Share via