ParisaTabassum-7447 Greetings & Welcome to Microsoft Q&A forum!
Please see below answers to your queries.
how could our partner team deploy and train a gpt-4 model in NCUS region and we can't? Was the region available before and has been decommissioned?
Please note that model availability depends on several factors such as capacity, usage etc.,
GPT-4 models are available in Standard deployment and Provisioned deployment types.
The availability of this model varies with the deployment type in a particular region.
In you case, please double check the deployment type with the partner.
There could also be a possibility that the model was available at the time of deployment by the partner as well.
Can we use TPM quota for gpt-4 instead of PTU quota?
To give more context, Quota is assigned to your subscription on a per-region, per-model basis in units of Tokens-per-Minute (TPM). When you onboard a subscription to Azure OpenAI, you'll receive default quota for most available models. Then, you'll assign TPM to each deployment as it is created, and the available quota for that model will be reduced by that amount. You can continue to create deployments and assign them TPM until you reach your quota limit. Once that happens, you can only create new deployments of that model by reducing the TPM assigned to other deployments of the same model (thus freeing TPM for use), or by requesting and being approved for a model quota increase in the desired region.
Unlike the Tokens Per Minute (TPM) quota used by other Azure OpenAI offerings, PTUs are model-independent. The PTUs might be used to deploy any supported model/version in the region.
See What is provisioned throughput? and Manage Azure OpenAI Service quota for more information.
If we try to transfer the trained model from their subscription to our subscription, will it work?
Yes, fine-tuning supports deploying a fine-tuned model to a different region than where the model was originally fine-tuned. You can also deploy to a different subscription/region.
The only limitations are that the new region must also support fine-tuning and when deploying cross subscription the account generating the authorization token for the deployment must have access to both the source and destination subscriptions.
Cross subscription/region deployment can be accomplished via Python or REST.
Do let me know if that helps or have any other queries.
If the response helped, please do click Accept Answer
and Yes
for was this answer helpful.
Doing so would help other community members with similar issue identify the solution. I highly appreciate your contribution to the community.