Thanks for the question. You can make calls to two different AOAI endpoints directly from your own app by checking tokens in response.
Here is the blog for load balancing the Azure Open AI services using APIM.
https://journeyofthegeek.com/2023/05/31/load-balancing-in-azure-openai-service/