Deployment of custom model takes a very long time

Question

Deployment of custom model takes a very long time

Sophia Li 20

Hello everyone,

I am trying to deploy one of the fine tuned model in azure openai studio. But the deployment already took 7hrs and is still ongoing even it says deployable on the model page. I tried to deployed some base models and it finishes immediately.

I wonder how could I solve the issue? Thank you in advance!

Deleted

This comment has been deleted due to a violation of our Code of Conduct. The comment was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.
AshokPeddakotla-MSFT 35,971 Reputation points Moderator

2024-07-15T03:58:52.3733333+00:00

Sophia Li Greetings & Welcome to Microsoft Q&A forum!

Could you also share the region where deployment is failing and the model details?

I'm checking on this issue internally and update you as earliest with my findings.

Appreciate your time and patience.
Sophia Li 20 Reputation points

2024-07-15T04:02:06.7133333+00:00

@AshokPeddakotla-MSFT

hi, it’s the north central us region, thank you
AshokPeddakotla-MSFT 35,971 Reputation points Moderator

2024-07-15T04:15:41.49+00:00

Sophia Li Thanks for the information. could you confirm if the deployment works fine in any other region?

What is the model?
Sophia Li 20 Reputation points

2024-07-15T04:17:46.02+00:00

The base model is gpt 4, I haven’t tried other regions because of limited quota @AshokPeddakotla-MSFT
AshokPeddakotla-MSFT 35,971 Reputation points Moderator

2024-07-15T07:40:36.3+00:00

Ok, I will check on this and update you.
Torsten 15 Reputation points

2024-07-15T13:01:32.7566667+00:00

Please let me know as well! I am experiencing similar issues as well. Region: Sweden-Central, Base model: gpt4 (0613).
Sophia Li 20 Reputation points

2024-07-16T14:27:36.0066667+00:00

@AshokPeddakotla-MSFT, Hi, the deployment completed!

Accepted answer

0 additional answers

Your answer

Deleted

This comment has been deleted due to a violation of our Code of Conduct. The comment was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.
AshokPeddakotla-MSFT 35,971 Reputation points Moderator

2024-07-15T03:58:52.3733333+00:00

Sophia Li Greetings & Welcome to Microsoft Q&A forum!

Could you also share the region where deployment is failing and the model details?

I'm checking on this issue internally and update you as earliest with my findings.

Appreciate your time and patience.
Sophia Li 20 Reputation points

2024-07-15T04:02:06.7133333+00:00

@AshokPeddakotla-MSFT

hi, it’s the north central us region, thank you
AshokPeddakotla-MSFT 35,971 Reputation points Moderator

2024-07-15T04:15:41.49+00:00

Sophia Li Thanks for the information. could you confirm if the deployment works fine in any other region?

What is the model?
Sophia Li 20 Reputation points

2024-07-15T04:17:46.02+00:00

The base model is gpt 4, I haven’t tried other regions because of limited quota @AshokPeddakotla-MSFT
AshokPeddakotla-MSFT 35,971 Reputation points Moderator

2024-07-15T07:40:36.3+00:00

Ok, I will check on this and update you.
Torsten 15 Reputation points

2024-07-15T13:01:32.7566667+00:00

Please let me know as well! I am experiencing similar issues as well. Region: Sweden-Central, Base model: gpt4 (0613).
Sophia Li 20 Reputation points

2024-07-16T14:27:36.0066667+00:00

@AshokPeddakotla-MSFT, Hi, the deployment completed!

Answer 1

Sophia Li Thanks for the confirmation.I have checked internally and confirm that this issue is resolved. To give more context, it happened due to the job allocated only one node resource, causing jobs that require multiple node resources for fine-tune model training to wait for resources indefinitely.

Our team has rolled back this configuration and issue is resolved.

If the response helped, please do click Accept Answer and Yes for was this answer helpful.

Doing so would help other community members with similar issue identify the solution. I highly appreciate your contribution to the community.

Share via

Deployment of custom model takes a very long time

0 additional answers

Your answer