Does the Azure OpenAI dynamic quota for deployments feature allows a deployment to use TPMs quotas from other deployments for the same mode/region?
Johannes Borch
0
Reputation points
I have a setup in Azure OpenAI where I have two deployments using the same model/region/subscription sharing the model TPM quota equally (50%). If I enable the new dynamic quota feature on my deployments, will one model be able to consume the full model quota at 100% when the other deployment does not consume tokens? Or is the dynamic quota feature just a way to increase the regional model quota temporarily if Azure region has capacity?
Sign in to answer