Share via


KubernetesOnlineDeployment Class

Definition

Properties specific to a KubernetesOnlineDeployment.

[System.ComponentModel.TypeConverter(typeof(Microsoft.Azure.PowerShell.Cmdlets.MachineLearningServices.Models.Api20220501.KubernetesOnlineDeploymentTypeConverter))]
public class KubernetesOnlineDeployment : Microsoft.Azure.PowerShell.Cmdlets.MachineLearningServices.Models.Api20220501.IKubernetesOnlineDeployment, Microsoft.Azure.PowerShell.Cmdlets.MachineLearningServices.Runtime.IValidates
[<System.ComponentModel.TypeConverter(typeof(Microsoft.Azure.PowerShell.Cmdlets.MachineLearningServices.Models.Api20220501.KubernetesOnlineDeploymentTypeConverter))>]
type KubernetesOnlineDeployment = class
    interface IKubernetesOnlineDeployment
    interface IJsonSerializable
    interface IOnlineDeploymentProperties
    interface IEndpointDeploymentPropertiesBase
    interface IValidates
Public Class KubernetesOnlineDeployment
Implements IKubernetesOnlineDeployment, IValidates
Inheritance
KubernetesOnlineDeployment
Attributes
Implements

Constructors

KubernetesOnlineDeployment()

Creates an new KubernetesOnlineDeployment instance.

Properties

AppInsightsEnabled

If true, enables Application Insights logging.

CodeConfiguration

Code configuration for the endpoint deployment.

CodeConfigurationCodeId

ARM resource ID of the code asset.

CodeConfigurationScoringScript

[Required] The script to execute on startup. eg. "score.py"

ContainerResourceLimitCpu

Number of vCPUs request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/

ContainerResourceLimitGpu

Number of Nvidia GPU cards request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/

ContainerResourceLimitMemory

Memory size request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/

ContainerResourceRequestCpu

Number of vCPUs request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/

ContainerResourceRequestGpu

Number of Nvidia GPU cards request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/

ContainerResourceRequestMemory

Memory size request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/

Description

Description of the endpoint deployment.

EndpointComputeType

[Required] The compute type of the endpoint.

EnvironmentId

ARM resource ID or AssetId of the environment specification for the endpoint deployment.

EnvironmentVariable

Environment variables configuration for the deployment.

InstanceType

Compute instance type.

LivenessProbe

Liveness probe monitors the health of the container regularly.

LivenessProbeFailureThreshold

The number of failures to allow before returning an unhealthy status.

LivenessProbeInitialDelay

The delay before the first probe in ISO 8601 format.

LivenessProbePeriod

The length of time between probes in ISO 8601 format.

LivenessProbeSuccessThreshold

The number of successful probes before returning a healthy status.

LivenessProbeTimeout

The probe timeout in ISO 8601 format.

Model

The URI path to the model.

ModelMountPath

The path to mount the model in custom container.

Property

Property dictionary. Properties can be added, but not removed or altered.

ProvisioningState

Provisioning state for the endpoint deployment.

ReadinessProbe

Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.

ReadinessProbeFailureThreshold

The number of failures to allow before returning an unhealthy status.

ReadinessProbeInitialDelay

The delay before the first probe in ISO 8601 format.

ReadinessProbePeriod

The length of time between probes in ISO 8601 format.

ReadinessProbeSuccessThreshold

The number of successful probes before returning a healthy status.

ReadinessProbeTimeout

The probe timeout in ISO 8601 format.

RequestSetting

Request settings for the deployment.

RequestSettingMaxConcurrentRequestsPerInstance

The number of maximum concurrent requests per node allowed per deployment. Defaults to 1.

RequestSettingMaxQueueWait

The maximum amount of time a request will stay in the queue in ISO 8601 format. Defaults to 500ms.

RequestSettingRequestTimeout

The scoring timeout in ISO 8601 format. Defaults to 5000ms.

ScaleSetting

Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.

ScaleSettingScaleType

[Required] Type of deployment scaling algorithm

Methods

DeserializeFromDictionary(IDictionary)

Deserializes a IDictionary into an instance of KubernetesOnlineDeployment.

DeserializeFromPSObject(PSObject)

Deserializes a PSObject into an instance of KubernetesOnlineDeployment.

FromJson(JsonNode)

Deserializes a JsonNode into an instance of Microsoft.Azure.PowerShell.Cmdlets.MachineLearningServices.Models.Api20220501.IKubernetesOnlineDeployment.

FromJsonString(String)

Creates a new instance of KubernetesOnlineDeployment, deserializing the content from a json string.

ToJson(JsonObject, SerializationMode)

Serializes this instance of KubernetesOnlineDeployment into a JsonNode.

ToJsonString()

Serializes this instance to a json string.

ToString()
Validate(IEventListener)

Validates that this object meets the validation criteria.

Applies to