CognitiveServicesManagedComputeDeploymentProperties Class

Definition

Properties of a Cognitive Services managed compute deployment.

public class CognitiveServicesManagedComputeDeploymentProperties : System.ClientModel.Primitives.IJsonModel<Azure.ResourceManager.CognitiveServices.Models.CognitiveServicesManagedComputeDeploymentProperties>, System.ClientModel.Primitives.IPersistableModel<Azure.ResourceManager.CognitiveServices.Models.CognitiveServicesManagedComputeDeploymentProperties>
type CognitiveServicesManagedComputeDeploymentProperties = class
    interface IJsonModel<CognitiveServicesManagedComputeDeploymentProperties>
    interface IPersistableModel<CognitiveServicesManagedComputeDeploymentProperties>
Public Class CognitiveServicesManagedComputeDeploymentProperties
Implements IJsonModel(Of CognitiveServicesManagedComputeDeploymentProperties), IPersistableModel(Of CognitiveServicesManagedComputeDeploymentProperties)
Inheritance
CognitiveServicesManagedComputeDeploymentProperties
Implements

Constructors

Name Description
CognitiveServicesManagedComputeDeploymentProperties(Uri)

Initializes a new instance of CognitiveServicesManagedComputeDeploymentProperties.

Properties

Name Description
AcceleratorsPerInstance

Read-only. Number of accelerators (GPUs) consumed by each model instance, sourced from the deployment template.

AcceleratorType

Accelerator type (e.g., H100_80GB). Optional on creation; immutable after creation.

ComputeId

Foundry compute ARM resource ID for VM-backed managed compute deployments. Required when sku.name is VmManagedCompute; immutable after creation.

DeploymentTemplate

Deployment template identifier. Optional on creation. Accepts an AzureML Registry deployment template URI or a project-scoped deployment template path for VmManagedCompute. Examples: azureml://registries/{registry}/deploymenttemplates/{template}/versions/{version}, projects/{project}/deploymentTemplates/{template}/versions/{version}

Model

AzureML Registry model asset URI. Required on creation; immutable after creation. Example: azureml://registries/{registry}/models/{model}/versions/{version}

Priority

Scheduling priority for VM-backed managed compute deployments. Immutable after creation.

ProvisioningDetails

Read-only. Status message and timestamp from the last provisioning operation.

ProvisioningState

Read-only. Current provisioning state.

Routes

Read-only. Inference route paths relative to the account endpoint. Populated when provisioningState is Succeeded.

TotalAccelerators

Read-only. Total accelerators allocated: sku.capacity (instances) x acceleratorsPerInstance.

VersionUpgradeOption

Template auto-upgrade policy. Defaults to OnceNewDefaultVersionAvailable.

Methods

Name Description
JsonModelCreateCore(Utf8JsonReader, ModelReaderWriterOptions)
JsonModelWriteCore(Utf8JsonWriter, ModelReaderWriterOptions)
PersistableModelCreateCore(BinaryData, ModelReaderWriterOptions)
PersistableModelWriteCore(ModelReaderWriterOptions)

Explicit Interface Implementations

Name Description
IJsonModel<CognitiveServicesManagedComputeDeploymentProperties>.Create(Utf8JsonReader, ModelReaderWriterOptions)
IJsonModel<CognitiveServicesManagedComputeDeploymentProperties>.Write(Utf8JsonWriter, ModelReaderWriterOptions)
IPersistableModel<CognitiveServicesManagedComputeDeploymentProperties>.Create(BinaryData, ModelReaderWriterOptions)
IPersistableModel<CognitiveServicesManagedComputeDeploymentProperties>.GetFormatFromOptions(ModelReaderWriterOptions)
IPersistableModel<CognitiveServicesManagedComputeDeploymentProperties>.Write(ModelReaderWriterOptions)

Applies to