Deployment of Prompt flow

Question

Deployment of Prompt flow

Anonymous

Hi, I have developed multiple prompt flows in Azure AI. I was deploying this flow as a real time endpoints. I have two questions to clarify.

Are these real time endpoint in Azure AI serverless?
Is there any option to deploy these prompt flows within a compute instance?

Accepted answer

0 additional answers

Your answer

Answer 1

Khaitan, Suraj (Ext) Greetings!

Are these real time endpoint in Azure AI serverless?

Yes, real-time endpoints in Azure AI are serverless. They are designed to scale automatically based on the incoming traffic and can handle multiple requests simultaneously without the need for you to manage any infrastructure.

Is there any option to deploy these prompt flows within a compute instance?

It is possible to deploy your prompt flows within a compute instance. You can use Azure Machine Learning Compute to create a cluster of virtual machines that can be used to run your prompt flows.

This gives you more control over the environment in which your flows are running and allows you to customize the resources allocated to them.

To deploy your prompt flows within a compute instance, you can use the Azure Machine Learning SDK.

Also, See Deploy a flow as a managed online endpoint for real-time inference for more details.

I hope this helps!

Do let me know if you have any further questions.

If the response helped, please do click Accept Answer and Yes for was this answer helpful.

Doing so would help other community members with similar issue identify the solution. I highly appreciate your contribution to the community.

Anonymous

2024-02-15T09:07:40.4366667+00:00

Are the serverless endpoints being charged on per request basis?

Can you point us to a documentation which can help us to deploy multiple prompt flows within a single compute instance? We have gone through the above SDK already, but it does not talk about deploying prompt flow directly in the compute instance.

Share via

Deployment of Prompt flow

0 additional answers

Your answer