LLMOps with prompt flow and GitHub

Article
05/21/2024

Large Language Models Operations, or LLMOps, has become the cornerstone of efficient prompt engineering and LLM-infused application development and deployment. As the demand for LLM-infused applications continues to soar, organizations find themselves in need of a cohesive and streamlined process to manage their end-to-end lifecycle.

Azure Machine Learning allows you to integrate with GitHub to automate the LLM-infused application development lifecycle with prompt flow.

Azure Machine Learning Prompt Flow provides a streamlined and structured approach to developing LLM-infused applications. Its well-defined process and lifecycle guides you through the process of building, testing, optimizing, and deploying flows, culminating in the creation of fully functional LLM-infused solutions.

LLMOps Prompt Flow Features

LLMOps with prompt flow is a "LLMOps template and guidance" to help you build LLM-infused apps using prompt flow. It provides the following features:

Centralized Code Hosting: This repo supports hosting code for multiple flows based on prompt flow, providing a single repository for all your flows. Think of this platform as a single repository where all your prompt flow code resides. It's like a library for your flows, making it easy to find, access, and collaborate on different projects.
Lifecycle Management: Each flow enjoys its own lifecycle, allowing for smooth transitions from local experimentation to production deployment.
Variant and Hyperparameter Experimentation: Experiment with multiple variants and hyperparameters, evaluating flow variants with ease. Variants and hyperparameters are like ingredients in a recipe. This platform allows you to experiment with different combinations of variants across multiple nodes in a flow.
Multiple Deployment Targets: The repo supports deployment of flows to Azure App Services, Kubernetes, Azure Managed computes driven through configuration ensuring that your flows can scale as needed. It also generates Docker images infused with Flow compute session and your flows for deployment to any target platform and Operating system supporting Docker.
A/B Deployment: Seamlessly implement A/B deployments, enabling you to compare different flow versions effortlessly. As in traditional A/B testing for websites, this platform facilitates A/B deployment for prompt flow. This means you can effortlessly compare different versions of a flow in a real-world setting to determine which performs best.
Many-to-many dataset/flow relationships: Accommodate multiple datasets for each standard and evaluation flow, ensuring versatility in flow test and evaluation. The platform is designed to accommodate multiple datasets for each flow.
Conditional Data and Model registration: The platform creates a new version for dataset in Azure Machine Learning Data Asset and flows in model registry only when there's a change in them, not otherwise.
Comprehensive Reporting: Generate detailed reports for each variant configuration, allowing you to make informed decisions. Provides detailed Metric collection, experiment, and variant bulk runs for all runs and experiments, enabling data-driven decisions in csv as well as HTML files.

Other features for customization:

Offers BYOF (bring-your-own-flows). A complete platform for developing multiple use-cases related to LLM-infused applications.
Offers configuration based development. No need to write extensive boiler-plate code.
Provides execution of both prompt experimentation and evaluation locally as well on cloud.
Provides notebooks for local evaluation of the prompts. Provides library of functions for local experimentation.
Endpoint testing within pipeline after deployment to check its availability and readiness.
Provides optional Human-in-loop to validate prompt metrics before deployment.

LLMOps with prompt flow provides capabilities for both simple as well as complex LLM-infused apps. It's customizable to the needs of the application.

LLMOps Stages

The lifecycle comprises four distinct stages:

Initialization: Clearly define the business objective, gather relevant data samples, establish a basic prompt structure, and craft a flow that enhances its capabilities.
Experimentation: Apply the flow to sample data, assess the prompt's performance, and refine the flow as needed. Continuously iterate until satisfied with the results.
Evaluation & Refinement: Benchmark the flow's performance using a larger dataset, evaluate the prompt's effectiveness, and make refinements accordingly. Progress to the next stage if the results meet the desired standards.
Deployment: Optimize the flow for efficiency and effectiveness, deploy it in a production environment including A/B deployment, monitor its performance, gather user feedback, and use this information to further enhance the flow.

By adhering to this structured methodology, Prompt Flow empowers you to confidently develop, rigorously test, fine-tune, and deploy flows, leading to the creation of robust and sophisticated AI applications.

LLMOps Prompt Flow template formalizes this structured methodology using code-first approach and helps you build LLM-infused apps using Prompt Flow using tools and process relevant to Prompt Flow. It offers a range of features including Centralized Code Hosting, Lifecycle Management, Variant and Hyperparameter Experimentation, A/B Deployment, reporting for all runs and experiments and more.

The repository for this article is available at LLMOps with Prompt flow template.

LLMOps process Flow

This is the initialization stage. Here, flows are developed, data is prepared and curated and LLMOps related configuration files are updated.
After local development using Visual Studio Code along with Prompt Flow extension, a pull request is raised from feature branch to development branch. This results in executed the Build validation pipeline. It also executes the experimentation flows.
The PR is manually approved and code is merged to the development branch
After the PR is merged to the development branch, the CI pipeline for dev environment is executed. It executes both the experimentation and evaluation flows in sequence and registers the flows in Azure Machine Learning Registry apart from other steps in the pipeline.
After the completion of CI pipeline execution, a CD trigger ensures the execution of CD pipeline that deploys the standard flow from Azure Machine Learning Registry as an Azure Machine Learning online endpoint and executed integration and smoke tests on the deployed flow.
A release branch is created from the development branch or a pull request is raised from development branch to release branch.
The PR is manually approved and code is merged to the release branch. After the PR is merged to the release branch, the CI pipeline for prod environment is executed. It executes both the experimentation and evaluation flows in sequence and registers the flows in Azure Machine Learning Registry apart from other steps in the pipeline.
After the completion of CI pipeline execution, a CD trigger ensures the execution of CD pipeline that deploys the standard flow from Azure Machine Learning Registry as an Azure Machine Learning online endpoint and executed integration and smoke tests on the deployed flow.

From here on, you can learn LLMOps with prompt flow by following the end-to-end samples we provided, which help you build LLM-infused applications using prompt flow and GitHub. Its primary objective is to provide assistance in the development of such applications, using the capabilities of prompt flow and LLMOps.

Tip

We recommend you understand how to integrate LLMOps with prompt flow.

Important

Prompt flow is currently in public preview. This preview is provided without a service-level agreement, and are not recommended for production workloads. Certain features might not be supported or might have constrained capabilities. For more information, see Supplemental Terms of Use for Microsoft Azure Previews.

Prerequisites

An Azure subscription. If you don't have an Azure subscription, create a free account before you begin. Try the free or paid version of Azure Machine Learning.
An Azure Machine Learning workspace.
Git running on your local machine.
GitHub as the source control repository.

Note

Git version 2.27 or newer is required. For more information on installing the Git command, see https://git-scm.com/downloads and select your operating system

Important

The CLI commands in this article were tested using Bash. If you use a different shell, you may encounter errors.

Set up Prompt Flow

Prompt Flow uses connections resource to connect to endpoints like Azure OpenAI, OpenAI, or Azure AI Search and uses compute session for the execution of the flows. These resources should be created before executing the flows in Prompt Flow.

Set up connections for prompt flow

Connections can be created through prompt flow portal UI or using the REST API. Follow the guidelines to create connections for prompt flow.

Select on the link to know more about connections.

Note

The sample flows use 'aoai' connection and connection named 'aoai' should be created to execute them.

Set up GitHub Repository

There are multiple steps that should be undertaken for setting up LLMOps process using GitHub Repository.

Fork and configure the repo

Follow the guidelines to create a forked repo in your GitHub organization. This repo uses two branches - main and development for code promotions and execution of pipelines in lieu of changes to code in them.

Set up authentication between GitHub and Azure

Follow the guidelines to use the earlier created Service Principal and set up authentication between GitHub repository and Azure Services.

This step configures a GitHub Secret that stores the Service Principal information. The workflows in the repository can read the connection information using the secret name. This helps to configure GitHub workflow steps to connect to Azure automatically.

Cloning the repo

Follow the guidelines to create a new local repository.

This helps us create a new feature branch from development branch and incorporate changes.

Test the pipelines

Follow the guidelines to test the pipelines. The steps are

Raise a PR(Pull Request) from a feature branch to development branch.
The PR pipeline should execute automatically as result of branch policy configuration.
The PR is then merged to the development branch.
The associated 'dev' pipeline is executed. This results in full CI and CD execution and result in provisioning or updating of existing Azure Machine Learning Endpoints.

The test outputs should be similar to ones shown at here.

Local execution

To harness the capabilities of the local execution, follow these installation steps:

Clone the Repository: Begin by cloning the template's repository from its GitHub repository.

git clone https://github.com/microsoft/llmops-promptflow-template.git

Set up env file: create .env file at top folder level and provide information for items mentioned. Add as many connection names as needed. All the flow examples in this repo use AzureOpenAI connection named aoai. Add a line aoai={"api_key": "","api_base": "","api_type": "azure","api_version": "2024-02-01"} with updated values for api_key and api_base. If extra connections with different names are used in your flows, they should be added accordingly. Currently, flow with AzureOpenAI as provider as supported.


experiment_name=
connection_name_1={ "api_key": "","api_base": "","api_type": "azure","api_version": "2023-03-15-preview"}
connection_name_2={ "api_key": "","api_base": "","api_type": "azure","api_version": "2023-03-15-preview"}

Prepare the local conda or virtual environment to install the dependencies.


python -m pip install promptflow promptflow-tools promptflow-sdk jinja2 promptflow[azure] openai promptflow-sdk[builtins] python-dotenv

Bring or write your flows into the template based on documentation here.
Write python scripts similar to the provided examples in local_execution folder.

Share via