Hello Jamie Voynow,
To request higher rate limits on your GPT-4o Azure AI Foundry deployment, you can submit a quota increase request and request a quota increase in the Azure AI Foundry.
Companies must be registered businesses and pass domain verification to be eligible for quota limit increases. If your usage is consuming the existing quota allocation due to high demand, your request for an increase may be prioritized.
As a Microsoft for Startups member, you have access to priority support. Sign in to the Microsoft for Startups portal and submit a request via the Guidance tab. A Startup Advisor will arrange a call to discuss your needs and assist you in the process.
Technical Guidance & Support Overview.
Additionally, you can enable the autoscale feature for your Azure AI services, which automatically adjusts rate limits based on real-time usage and capacity metrics to optimize performance.
Please refer this Autoscale Azure AI limits.
I Hope this helps. Do let me know if you have any further queries.
If this answers your query, please do click Accept Answer
and Yes
for was this answer helpful.
Thank you!