Azure OpenAI Service model deprecations and retirements
Overview
Azure OpenAI Service models are continually refreshed with newer and more capable models. As part of this process, we deprecate and retire older models. This document provides information about the models that are currently available, deprecated, and retired.
Terminology
- Retirement
- When a model is retired, it's no longer available for use. Azure OpenAI Service deployments of a retired model always return error responses.
- Deprecation
- When a model is deprecated, it's no longer available for new customers. It continues to be available for use by customers with existing deployments until the model is retired.
Notifications
Azure OpenAI notifies customers of active Azure OpenAI Service deployments for models with upcoming retirements. We notify customers of upcoming retirements as follows for each deployment:
- At model launch, we programmatically designate a "not sooner than" retirement date (typically six months to one year out).
- At least 60 days notice before model retirement for Generally Available (GA) models.
- At least 14 days notice before preview model version upgrades.
Retirements are done on a rolling basis, region by region.
Model availability
- At least one year of model availability for GA models after the release date of a model in at least one region worldwide
- For global deployments, all future model versions starting with
gpt-4o
andgpt-4 0409
will be available with their (N
) next succeeding model (N+1
) for comparison together. - Customers have 60 days to try out a new GA model in at least one global, or standard region, before any upgrades happen to a newer GA model.
Considerations for the Azure public cloud
Be aware of the following:
- All model version combinations will not be available in all regions.
- Model version
N
andN+1
might not always be available in the same region. - GA model version
N
might upgrade to a future model versionN+X
in some regions based on capacity limitations, and without the new model versionN+X
separately being available to test in the same region. The new model version will be available to test in other regions before any upgrades are scheduled. - Preview model versions and GA versions of the same model won't always be available to test together in the same region. There will be preview and GA versions available to test in different regions.
- We reserve the right to limit future customers using a particular region to balance service quality for existing customers.
- As always at Microsoft, security is of the utmost importance. If a model or model version is found to have compliance or security issues, we reserve the right to invoke the need to do emergency retirements. See the terms of service for more information.
Special considerations for Azure Government clouds
- Global standard deployments won't be available in government clouds.
- Not all models or model versions available in commercial / public cloud will be available in government clouds.
- In the Azure Government clouds, we intend to support only one version of a given model at a time.
- For example only one version of
gpt-35-turbo 0125
andgpt-4o (2024-05-13)
.
- For example only one version of
- There will however be a 30 day overlap between new model versions, where more than two will be available.
- For example if
gpt-35-turbo 0125
orgpt-4o (2024-05-13)
is updated to a future version, or - for model family changes beyond version updates, such as when moving from
gpt-4 1106-preview
togpt-4o (2024-05-13)
.
- For example if
Who is notified of upcoming retirements
Azure OpenAI notifies those who are members of the following roles for each subscription with a deployment of a model with an upcoming retirement.
- Owner
- Contributor
- Reader
- Monitoring contributor
- Monitoring reader
How to get ready for model retirements and version upgrades
To prepare for model retirements and version upgrades, we recommend that customers evaluate their applications with the new models and versions and evaluate their behavior. We also recommend that customers update their applications to use the new models and versions before the retirement date.
For more information, see How to upgrade to a new model or version.
Current models
Note
Not all models go through a deprecation period prior to retirement. Some models/versions only have a retirement date.
These models are currently available for use in Azure OpenAI Service.
Model | Version | Retirement date |
---|---|---|
gpt-35-turbo |
0301 | No earlier than October 1, 2024 |
gpt-35-turbo gpt-35-turbo-16k |
0613 | October 1, 2024 |
gpt-35-turbo |
1106 | No earlier than Nov 17, 2024 |
gpt-35-turbo |
0125 | No earlier than Feb 22, 2025 |
gpt-4 gpt-4-32k |
0314 | Deprecation: October 1, 2024 Retirement: June 6, 2025 |
gpt-4 gpt-4-32k |
0613 | Deprecation: October 1, 2024 Retirement: June 6, 2025 |
gpt-4 |
1106-preview | To be upgraded to gpt-4 Version: turbo-2024-04-09 , starting on August 15, 2024, or later 1 |
gpt-4 |
0125-preview | To be upgraded to gpt-4 Version: turbo-2024-04-09 , starting on August 15, 2024, or later 1 |
gpt-4 |
vision-preview | To be upgraded to gpt-4 Version: turbo-2024-04-09 , starting on August 15, 2024, or later 1 |
gpt-3.5-turbo-instruct |
0914 | No earlier than Sep 14, 2025 |
text-embedding-ada-002 |
2 | No earlier than April 3, 2025 |
text-embedding-ada-002 |
1 | No earlier than April 3, 2025 |
text-embedding-3-small |
No earlier than Feb 2, 2025 | |
text-embedding-3-large |
No earlier than Feb 2, 2025 |
1 We will notify all customers with these preview deployments at least two weeks before the start of the upgrades. We will publish an upgrade schedule detailing the order of regions and model versions that we will follow during the upgrades, and link to that schedule from here.
Deprecated models
These models were deprecated on July 6, 2023 and were retired on June 14, 2024. These models are no longer available for new deployments. Deployments created before July 6, 2023 remain available to customers until June 14, 2024. We recommend customers migrate their applications to deployments of replacement models before the June 14, 2024 retirement.
If you're an existing customer looking for information about these models, see Legacy models.
Model | Deprecation date | Retirement date | Suggested replacement |
---|---|---|---|
ada | July 6, 2023 | June 14, 2024 | babbage-002 |
babbage | July 6, 2023 | June 14, 2024 | babbage-002 |
curie | July 6, 2023 | June 14, 2024 | davinci-002 |
davinci | July 6, 2023 | June 14, 2024 | davinci-002 |
text-ada-001 | July 6, 2023 | June 14, 2024 | gpt-35-turbo-instruct |
text-babbage-001 | July 6, 2023 | June 14, 2024 | gpt-35-turbo-instruct |
text-curie-001 | July 6, 2023 | June 14, 2024 | gpt-35-turbo-instruct |
text-davinci-002 | July 6, 2023 | June 14, 2024 | gpt-35-turbo-instruct |
text-davinci-003 | July 6, 2023 | June 14, 2024 | gpt-35-turbo-instruct |
code-cushman-001 | July 6, 2023 | June 14, 2024 | gpt-35-turbo-instruct |
code-davinci-002 | July 6, 2023 | June 14, 2024 | gpt-35-turbo-instruct |
text-similarity-ada-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
text-similarity-babbage-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
text-similarity-curie-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
text-similarity-davinci-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
text-search-ada-doc-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
text-search-ada-query-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
text-search-babbage-doc-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
text-search-babbage-query-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
text-search-curie-doc-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
text-search-curie-query-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
text-search-davinci-doc-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
text-search-davinci-query-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
code-search-ada-code-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
code-search-ada-text-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
code-search-babbage-code-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
code-search-babbage-text-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
Retirement and deprecation history
July 18, 2024
- Updated
gpt-4
0613 deprecation date to October 1, 2024 and the retirement date to June 6, 2025.
June 19, 2024
- Updated
gpt-35-turbo
0301 retirement date to no earlier than October 1, 2024. - Updated
gpt-35-turbo
&gpt-35-turbo-16k
0613 retirement date to October 1, 2024. - Updated
gpt-4
&gpt-4-32k
0314 deprecation date to October 1, 2024, and retirement date to June 6, 2025.
June 4, 2024
Retirement date for legacy models updated by one month.
April 24, 2024
Earliest retirement date for gpt-35-turbo
0301 and 0613 has been updated to August 1, 2024.
March 13, 2024
We published this document to provide information about the current models, deprecated models, and upcoming retirements.
February 23, 2024
We announced the upcoming in-place upgrade of gpt-4
version 1106-preview
to 0125-preview
to start no earlier than March 8, 2024.
November 30, 2023
The default version of gpt-4
and gpt-3-32k
was updated from 0314
to 0613
starting on November 30, 2023. The upgrade of 0314
deployments set for autoupgrade to 0613
was completed on December 3, 2023.
July 6, 2023
We announced the deprecation of models with upcoming retirement on July 5, 2024.
Povratne informacije
https://aka.ms/ContentUserFeedback.
Kmalu na voljo: V letu 2024 bomo ukinili storitev Težave v storitvi GitHub kot mehanizem za povratne informacije za vsebino in jo zamenjali z novim sistemom za povratne informacije. Za več informacij si oglejte:Pošlji in prikaži povratne informacije za