Hello @Stawsh Murawski , Thanks for using Microsoft Q&A Platform.
Yes, you can make the concurrent calls to the Azure OpenAI Endpoint. However, the maximum number of concurrent executions is going to be limited by capacity (Tokens-per-Minute (TPM).
Please note that there are rate limits imposed by Azure OpenAI. These limits define the maximum number of requests you can make within a certain time frame. Exceeding these limits might result in throttling or errors.
You can refer to the quota limits here: https://learn.microsoft.com/en-us/azure/ai-services/openai/quotas-limits
Your use-case looks similar to Multi-Agent Systems functionality. With the Azure Assistant API, you can create multi-agent systems that are persistent and infinitely long, and append messages as users interact, all while the Assistant accesses files and tools in parallel, including a Code Interpreter and function calling capabilities.
Here is the documentation you can refer to: https://github.com/Azure-Samples/azureai-samples/tree/main/scenarios/Assistants/multi-agent
I hope this helps.
Regards,
Vasavi
-Please kindly accept the answer and vote 'yes' if you feel helpful to support the community, thanks.