CognitiveServicesManagedComputeDeploymentProperties Class
Definition
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
Properties of a Cognitive Services managed compute deployment.
public class CognitiveServicesManagedComputeDeploymentProperties : System.ClientModel.Primitives.IJsonModel<Azure.ResourceManager.CognitiveServices.Models.CognitiveServicesManagedComputeDeploymentProperties>, System.ClientModel.Primitives.IPersistableModel<Azure.ResourceManager.CognitiveServices.Models.CognitiveServicesManagedComputeDeploymentProperties>
type CognitiveServicesManagedComputeDeploymentProperties = class
interface IJsonModel<CognitiveServicesManagedComputeDeploymentProperties>
interface IPersistableModel<CognitiveServicesManagedComputeDeploymentProperties>
Public Class CognitiveServicesManagedComputeDeploymentProperties
Implements IJsonModel(Of CognitiveServicesManagedComputeDeploymentProperties), IPersistableModel(Of CognitiveServicesManagedComputeDeploymentProperties)
- Inheritance
-
CognitiveServicesManagedComputeDeploymentProperties
- Implements
Constructors
| Name | Description |
|---|---|
| CognitiveServicesManagedComputeDeploymentProperties(Uri) |
Initializes a new instance of CognitiveServicesManagedComputeDeploymentProperties. |
Properties
| Name | Description |
|---|---|
| AcceleratorsPerInstance |
Read-only. Number of accelerators (GPUs) consumed by each model instance, sourced from the deployment template. |
| AcceleratorType |
Accelerator type (e.g., H100_80GB). Optional on creation; immutable after creation. |
| ComputeId |
Foundry compute ARM resource ID for VM-backed managed compute deployments. Required when sku.name is VmManagedCompute; immutable after creation. |
| DeploymentTemplate |
Deployment template identifier. Optional on creation. Accepts an AzureML Registry deployment template URI or a project-scoped deployment template path for VmManagedCompute. Examples: azureml://registries/{registry}/deploymenttemplates/{template}/versions/{version}, projects/{project}/deploymentTemplates/{template}/versions/{version} |
| Model |
AzureML Registry model asset URI. Required on creation; immutable after creation. Example: azureml://registries/{registry}/models/{model}/versions/{version} |
| Priority |
Scheduling priority for VM-backed managed compute deployments. Immutable after creation. |
| ProvisioningDetails |
Read-only. Status message and timestamp from the last provisioning operation. |
| ProvisioningState |
Read-only. Current provisioning state. |
| Routes |
Read-only. Inference route paths relative to the account endpoint. Populated when provisioningState is Succeeded. |
| TotalAccelerators |
Read-only. Total accelerators allocated: sku.capacity (instances) x acceleratorsPerInstance. |
| VersionUpgradeOption |
Template auto-upgrade policy. Defaults to OnceNewDefaultVersionAvailable. |