Deployment of Prompt flow

Anonymous
2024-02-15T03:45:58.8033333+00:00

Hi, I have developed multiple prompt flows in Azure AI. I was deploying this flow as a real time endpoints. I have two questions to clarify.

  1. Are these real time endpoint in Azure AI serverless?
  2. Is there any option to deploy these prompt flows within a compute instance?
Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
3,333 questions
0 comments No comments
{count} votes

Accepted answer
  1. AshokPeddakotla-MSFT 35,971 Reputation points Moderator
    2024-02-15T08:56:16.5266667+00:00

    Khaitan, Suraj (Ext) Greetings!

    Are these real time endpoint in Azure AI serverless?

    Yes, real-time endpoints in Azure AI are serverless. They are designed to scale automatically based on the incoming traffic and can handle multiple requests simultaneously without the need for you to manage any infrastructure.

    Is there any option to deploy these prompt flows within a compute instance?

    It is possible to deploy your prompt flows within a compute instance. You can use Azure Machine Learning Compute to create a cluster of virtual machines that can be used to run your prompt flows.

    This gives you more control over the environment in which your flows are running and allows you to customize the resources allocated to them.

    To deploy your prompt flows within a compute instance, you can use the Azure Machine Learning SDK.

    Also, See Deploy a flow as a managed online endpoint for real-time inference for more details.

    I hope this helps!

    Do let me know if you have any further questions.

    If the response helped, please do click Accept Answer and Yes for was this answer helpful.

    Doing so would help other community members with similar issue identify the solution. I highly appreciate your contribution to the community.


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.