Evenementer
Mar 17, 9 PM - Mar 21, 10 AM
Maacht mat bei der Meetup-Serie, fir skaléierbar KI-Léisungen op Basis vu realistesche Benotzungsfäll mat aneren Entwéckler an Experten ze bauen.
Elo umellenDëse Browser gëtt net méi ënnerstëtzt.
Upgrat op Microsoft Edge fir vun de Virdeeler vun leschten Eegeschaften, Sécherheetsupdaten, an techneschem Support ze profitéieren.
In early December, 2024, Microsoft launched several changes to the provisioned offering. These changes include:
This article is intended for existing users of the provisioned throughput offering. New customers should refer to the Azure OpenAI provisioned onboarding guide.
The changes below apply to the global provisioned, data zone provisioned, and provisioned deployment types.
Wichteg
The changes in this article do not apply to the older "Provisioned Classic (PTU-C)" offering. They only affect the Provisioned (also known as the Provisioned Managed) offering.
Data zone provisioned deployments are available in the same Azure OpenAI resource as all other Azure OpenAI deployment types but allow you to leverage Azure's global infrastructure to dynamically route traffic to the data center within the Microsoft defined data zone with the best availability for each request. Data zone provisioned deployments provide reserved model processing capacity for high and predictable throughput using Azure global infrastructure within the Microsoft defined data zone. Data zone deployments are supported for gpt-4o and gpt-4o-mini model families.
For more information, see the deployment types guide.
In August 2024, Microsoft announced that Provisioned deployments would move to a new hourly payment model with the option to purchase Azure Reservations to support additional discounts. In December's provisioned update, we will be introducing differentiated hourly pricing across global provisioned, data zone provisioned, and provisioned deployment types. For more information on the hourly price for each provisioned deployment type, see the Pricing details page.
In addition to the updates for the hourly payment model, new Azure Reservations will be introduced specifically for global and data zone provisioned deployment types. With these new Azure Reservations, every provisioned deployment type will have a separate Azure Reservation that can be purchased to support additional discounts. The mapping between each provisioned deployment type and the associated Azure Reservation are as follows:
Provisioned deployment type | Sku name in code | Azure Reservation product name |
---|---|---|
Global provisioned | GlobalProvisionedManaged |
Provisioned Managed Global |
Data zone provisioned | DataZoneProvisionedManaged |
Provisioned Managed Data Zone |
Provisioned | ProvisionedManaged |
Provisioned Managed Regional |
Wichteg
Azure Reservations for Azure OpenAI provisioned offers are not interchangeable across deployment types. The Azure Reservation purchased must match the provisioned deployment type. If the Azure Reservation purchased does not match the provisioned deployment type, the provisioned deployment will default to the hourly payment model until a matching Azure Reservation product is purchased. For more information, see the Azure Reservations for Azure OpenAI Service provisioned guidance.
Existing customers of provisioned deployments can choose to migrate to global or data zone provisioned deployments to benefit from the lower deployment minimums, granular scale increments, or differentiated pricing available for these deployment types. To learn more about how global and data zone provisioned deployments handle data processing across Azure geographies, see the Azure OpenAI deployment data processing documentation.
Two approaches are available for customers to migrate from provisioned deployments to global or data zone provisioned deployments.
The zero downtime migration approach allows customers to migrate their existing provisioned deployments to global or data zone provisioned deployments without interrupting the existing inference traffic on their deployment. This migration approach minimizes workload interruptions, but does require a customer to have multiple coexisting deployments while shifting traffic over. The process to migrate a provisioned deployment using the zero downtime migration approach is as follows:
The migration with downtime approach involves migrating existing provisioned deployments to global or data zone provisioned deployments while stopping any existing inference traffic on the original provisioned deployment. This migration approach does not require coexistence of multiple deployments to support but does require workload interruption to complete. The process to migrate a provisioned deployment using the migration with downtime approach is as follows:
Azure Reservations for Azure OpenAI Service provisioned offers are specific to the provisioned deployment type. If the Azure Reservation purchased does not match the provisioned deployment type, the deployment will default to the hourly payment model. If you choose to migrate to global or data zone provisioned deployments, you might need to purchase a new Azure Reservation for these deployments to support additional discounts. For more information on how to purchase a new Azure Reservation or make changes to an existing Azure Reservation, see the Azure Reservations for Azure OpenAI Service Provisioned guidance.
Evenementer
Mar 17, 9 PM - Mar 21, 10 AM
Maacht mat bei der Meetup-Serie, fir skaléierbar KI-Léisungen op Basis vu realistesche Benotzungsfäll mat aneren Entwéckler an Experten ze bauen.
Elo umellenTraining
Modul
Optimize spend and performance with Azure OpenAI Service provisioned reservations - Training
This module introduces you to provisioned deployments in Azure OpenAI services.
Zertifizéierung
Microsoft Certified: Azure AI Fundamentals - Certifications
Demonstrate fundamental AI concepts related to the development of software and services of Microsoft Azure to create AI solutions.
Dokumentatioun
Azure OpenAI Provisioned August 2024 Update - Azure OpenAI
Learn about the improvements to Provisioned Throughput
Azure OpenAI Service Provisioned Throughput Units (PTU) onboarding - Azure AI services
Learn about provisioned throughput units onboarding and Azure OpenAI.
Walkthrough on how to get started provisioned deployments on Azure OpenAI Service.