Hi Jason
sorry for delay
To increase the token rate limit for your Azure OpenAI Service, you need to request a quota increase. Here are the steps you can follow.
- Your maximum quota values may be lower if your Azure subscription is linked to certain offer types. For example, if you're on a free trial or a student subscription, your limit might be 1,000 tokens per minute.
- You can submit a quota increase request via the quota increase request formNote that due to high demand, requests are filled in the order they are received, and priority is given to customers who are actively consuming their existing quota.
- If you have the ability to modify your deployment settings, you can adjust the Tokens-Per-Minute (TPM) allocation. This can be done in the Azure AI Foundry portal under the Deployments section.
- If you are unable to modify the rate limit due to your current subscription type, you may need to upgrade your subscription or change your offer type to access higher limits.
Kindly Refer these document https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/quota#understanding-rate-limits
I hope these helps you. Thank you!