Hello Chad Solmon,Thanks for your question.
This is possibly due to.:
- Misconfigured token usage in your .NET application.
- An overrun of the token limit due to application logic or concurrent requests.
- The resource not being correctly set up in terms of quotas or permissions.
Could you possibly configure use retry logic to gracefully handle rate-limit errors and test?
Also Go to the Azure Portal > Select your OpenAI resource > Usage + quotas > Request quota increase.
You can mark it 'Accept Answer' and 'Upvote' if this helped you
Regards,
Abiola