Llama 3.1 405B Instruct as a serverless API not working with stream response

wong2 5 Reputation points
2024-07-24T06:06:26.6333333+00:00

I have deployed Llama 3.1 models with Azure AI Studio by following this document: https://learn.microsoft.com/en-us/azure/ai-studio/how-to/deploy-models-llama?tabs=llama-three

When calling it with the API, it works if stream is set to false:User's image

But if I set stream to true, the response is empty:User's image

Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,895 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.