OpenAI Service Assistant Playground Rate Limit Error
I'm brand new to Azure, and trying to assess Azure for a prototype/demo of an app I am working on using Azure OpenAI Services, leveraging the assistants feature. As such, I made a new account to take advantage of the $200 credits I am given.
I set up a subscription, a resource group, and then added the OpenAI Service to the resource group.
When I went to make an assistant, I needed to make a deployment. However, I kept getting a warning about insufficient quota. I was still able to deploy a model in regular chat, and I could chat with it. However, I could not make an assistant through most deployment types. For some deployment types I was actually able to make an assistant and not have that quota error, however when trying to chat with it I kept getting a "rate_limit_exceeded: Rate limit is exceeded. Try again in 86400 seconds." error, when all I sent was a single message with the text "Hello".
I eventually learned that I may need to upgrade my account. I upgraded my account to the "Standard" tier.
Since doing that, the quotas warning is no longer there when making a deployment when making an assistant. I can even set the TPM now for the assistant. Clearly progress was made.
I now make a deployment for my assistant and send a single "Hello" message. I am once again met with the "rate_limit_exceeded: Rate limit is exceeded. Try again in 86400 seconds." error. I tried increasing the TPM from 2k to 11k and still get the same exact error.
The regions are all the same "East US". My quotas in the OpenAI Service are there and look fine as far as I can tell. In my subscription, when I go to Usage + Quotas, it's empty. There is also a warning at the top that says "The selected provider is not registered for some of the selected subscriptions. To access your quotas, register the resource provider.". As far as I can tell, Microsoft.CognitiveServices is already registered. I also already assigned my account the "Cognitive Services Usages Reader" permission.
How do I solve this?