Serverless Endpoints - Create Or Update
Create or update Serverless Endpoint (asynchronous).
PUT https://management.azure.com/subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/Microsoft.MachineLearningServices/workspaces/{workspaceName}/serverlessEndpoints/{name}?api-version=2024-04-01
URI Parameters
Name | In | Required | Type | Description |
---|---|---|---|---|
name
|
path | True |
string |
Serverless Endpoint name. Regex pattern: |
resource
|
path | True |
string |
The name of the resource group. The name is case insensitive. |
subscription
|
path | True |
string |
The ID of the target subscription. |
workspace
|
path | True |
string |
Name of Azure Machine Learning workspace. Regex pattern: |
api-version
|
query | True |
string |
The API version to use for this operation. |
Request Body
Name | Required | Type | Description |
---|---|---|---|
location | True |
string |
The geo-location where the resource lives |
properties | True |
[Required] Additional attributes of the entity. |
|
identity |
Managed service identity (system assigned and/or user assigned identities) |
||
kind |
string |
Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type. |
|
sku |
Sku details required for ARM contract for Autoscaling. |
||
tags |
object |
Resource tags. |
Responses
Name | Type | Description |
---|---|---|
200 OK |
Create or update request is successful. |
|
201 Created |
Created Headers
|
|
Other Status Codes |
Error |
Examples
CreateOrUpdate Workspace Serverless Endpoint.
Sample request
PUT https://management.azure.com/subscriptions/00000000-1111-2222-3333-444444444444/resourceGroups/test-rg/providers/Microsoft.MachineLearningServices/workspaces/my-aml-workspace/serverlessEndpoints/string?api-version=2024-04-01
{
"properties": {
"modelSettings": {
"modelId": "string"
},
"authMode": "Key",
"contentSafety": {
"contentSafetyStatus": "Enabled"
}
},
"tags": {},
"location": "string",
"kind": "string",
"identity": {
"type": "SystemAssigned",
"userAssignedIdentities": {
"string": {}
}
},
"sku": {
"name": "string",
"tier": "Standard",
"size": "string",
"family": "string",
"capacity": 1
}
}
Sample response
{
"id": "string",
"name": "string",
"type": "string",
"properties": {
"modelSettings": {
"modelId": "string"
},
"authMode": "Key",
"inferenceEndpoint": {
"uri": "https://www.contoso.com/example",
"headers": {
"string": "string"
}
},
"provisioningState": "Updating",
"endpointState": "Deleting",
"marketplaceSubscriptionId": "string",
"contentSafety": {
"contentSafetyStatus": "Disabled"
}
},
"systemData": {
"createdAt": "2020-01-01T12:34:56.999+00:14",
"createdBy": "string",
"createdByType": "Application",
"lastModifiedAt": "2020-01-01T12:34:56.999+00:14",
"lastModifiedBy": "string",
"lastModifiedByType": "Key"
},
"tags": {},
"location": "string",
"kind": "string",
"identity": {
"type": "UserAssigned",
"principalId": "00000000-1111-2222-3333-444444444444",
"tenantId": "00000000-1111-2222-3333-444444444444",
"userAssignedIdentities": {
"string": {
"principalId": "00000000-1111-2222-3333-444444444444",
"clientId": "00000000-1111-2222-3333-444444444444"
}
}
},
"sku": {
"name": "string",
"tier": "Free",
"size": "string",
"family": "string",
"capacity": 1
}
}
{
"id": "string",
"name": "string",
"type": "string",
"properties": {
"modelSettings": {
"modelId": "string"
},
"authMode": "Key",
"inferenceEndpoint": {
"uri": "https://www.contoso.com/example",
"headers": {
"string": "string"
}
},
"provisioningState": "Failed",
"endpointState": "Deleting",
"marketplaceSubscriptionId": "string",
"contentSafety": {
"contentSafetyStatus": "Disabled"
}
},
"systemData": {
"createdAt": "2020-01-01T12:34:56.999+00:14",
"createdBy": "string",
"createdByType": "ManagedIdentity",
"lastModifiedAt": "2020-01-01T12:34:56.999+00:14",
"lastModifiedBy": "string",
"lastModifiedByType": "Key"
},
"tags": {},
"location": "string",
"kind": "string",
"identity": {
"type": "SystemAssigned,UserAssigned",
"principalId": "00000000-1111-2222-3333-444444444444",
"tenantId": "00000000-1111-2222-3333-444444444444",
"userAssignedIdentities": {
"string": {
"principalId": "00000000-1111-2222-3333-444444444444",
"clientId": "00000000-1111-2222-3333-444444444444"
}
}
},
"sku": {
"name": "string",
"tier": "Standard",
"size": "string",
"family": "string",
"capacity": 1
}
}
Definitions
Name | Description |
---|---|
Content |
|
Content |
Specifies the status of content safety. |
created |
The type of identity that created the resource. |
Endpoint |
State of endpoint provisioning. |
Error |
The resource management error additional info. |
Error |
The error detail. |
Error |
Error response |
Managed |
Managed service identity (system assigned and/or user assigned identities) |
Managed |
Type of managed service identity (where both SystemAssigned and UserAssigned types are allowed). |
Model |
|
Serverless |
|
Serverless |
State of the Serverless Endpoint. |
Serverless |
|
Serverless |
|
Serverless |
|
Sku |
The resource model definition representing SKU |
Sku |
This field is required to be implemented by the Resource Provider if the service has more than one tier, but is not required on a PUT. |
system |
Metadata pertaining to creation and last modification of the resource. |
User |
User assigned identity properties |
ContentSafety
Name | Type | Description |
---|---|---|
contentSafetyStatus |
[Required] Specifies the status of content safety. |
ContentSafetyStatus
Specifies the status of content safety.
Name | Type | Description |
---|---|---|
Disabled |
string |
|
Enabled |
string |
createdByType
The type of identity that created the resource.
Name | Type | Description |
---|---|---|
Application |
string |
|
Key |
string |
|
ManagedIdentity |
string |
|
User |
string |
EndpointProvisioningState
State of endpoint provisioning.
Name | Type | Description |
---|---|---|
Canceled |
string |
|
Creating |
string |
|
Deleting |
string |
|
Failed |
string |
|
Succeeded |
string |
|
Updating |
string |
ErrorAdditionalInfo
The resource management error additional info.
Name | Type | Description |
---|---|---|
info |
object |
The additional info. |
type |
string |
The additional info type. |
ErrorDetail
The error detail.
Name | Type | Description |
---|---|---|
additionalInfo |
The error additional info. |
|
code |
string |
The error code. |
details |
The error details. |
|
message |
string |
The error message. |
target |
string |
The error target. |
ErrorResponse
Error response
Name | Type | Description |
---|---|---|
error |
The error object. |
ManagedServiceIdentity
Managed service identity (system assigned and/or user assigned identities)
Name | Type | Description |
---|---|---|
principalId |
string |
The service principal ID of the system assigned identity. This property will only be provided for a system assigned identity. |
tenantId |
string |
The tenant ID of the system assigned identity. This property will only be provided for a system assigned identity. |
type |
Type of managed service identity (where both SystemAssigned and UserAssigned types are allowed). |
|
userAssignedIdentities |
<string,
User |
User-Assigned Identities |
ManagedServiceIdentityType
Type of managed service identity (where both SystemAssigned and UserAssigned types are allowed).
Name | Type | Description |
---|---|---|
None |
string |
|
SystemAssigned |
string |
|
SystemAssigned,UserAssigned |
string |
|
UserAssigned |
string |
ModelSettings
Name | Type | Description |
---|---|---|
modelId |
string |
The unique model identifier that this ServerlessEndpoint should provision. |
ServerlessEndpoint
Name | Type | Description |
---|---|---|
authMode |
[Required] Specifies the authentication mode for the Serverless endpoint. |
|
contentSafety |
Specifies the content safety options. If omitted, the default content safety settings will be configured |
|
endpointState |
The current state of the ServerlessEndpoint. |
|
inferenceEndpoint |
The inference uri to target when making requests against the serverless endpoint |
|
marketplaceSubscriptionId |
string |
The MarketplaceSubscription Azure ID associated to this ServerlessEndpoint. |
modelSettings |
The model settings (model id) for the model being serviced on the ServerlessEndpoint. |
|
provisioningState |
Provisioning state for the endpoint. |
ServerlessEndpointState
State of the Serverless Endpoint.
Name | Type | Description |
---|---|---|
Creating |
string |
|
CreationFailed |
string |
|
Deleting |
string |
|
DeletionFailed |
string |
|
Online |
string |
|
Reinstating |
string |
|
Suspended |
string |
|
Suspending |
string |
|
Unknown |
string |
ServerlessEndpointTrackedResource
Name | Type | Description |
---|---|---|
id |
string |
Fully qualified resource ID for the resource. Ex - /subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/{resourceProviderNamespace}/{resourceType}/{resourceName} |
identity |
Managed service identity (system assigned and/or user assigned identities) |
|
kind |
string |
Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type. |
location |
string |
The geo-location where the resource lives |
name |
string |
The name of the resource |
properties |
[Required] Additional attributes of the entity. |
|
sku |
Sku details required for ARM contract for Autoscaling. |
|
systemData |
Azure Resource Manager metadata containing createdBy and modifiedBy information. |
|
tags |
object |
Resource tags. |
type |
string |
The type of the resource. E.g. "Microsoft.Compute/virtualMachines" or "Microsoft.Storage/storageAccounts" |
ServerlessInferenceEndpoint
Name | Type | Description |
---|---|---|
headers |
object |
Specifies any required headers to target this serverless endpoint. |
uri |
string |
[Required] The inference uri to target when making requests against the Serverless Endpoint. |
ServerlessInferenceEndpointAuthMode
Name | Type | Description |
---|---|---|
Key |
string |
Sku
The resource model definition representing SKU
Name | Type | Description |
---|---|---|
capacity |
integer |
If the SKU supports scale out/in then the capacity integer should be included. If scale out/in is not possible for the resource this may be omitted. |
family |
string |
If the service has different generations of hardware, for the same SKU, then that can be captured here. |
name |
string |
The name of the SKU. Ex - P3. It is typically a letter+number code |
size |
string |
The SKU size. When the name field is the combination of tier and some other value, this would be the standalone code. |
tier |
This field is required to be implemented by the Resource Provider if the service has more than one tier, but is not required on a PUT. |
SkuTier
This field is required to be implemented by the Resource Provider if the service has more than one tier, but is not required on a PUT.
Name | Type | Description |
---|---|---|
Basic |
string |
|
Free |
string |
|
Premium |
string |
|
Standard |
string |
systemData
Metadata pertaining to creation and last modification of the resource.
Name | Type | Description |
---|---|---|
createdAt |
string |
The timestamp of resource creation (UTC). |
createdBy |
string |
The identity that created the resource. |
createdByType |
The type of identity that created the resource. |
|
lastModifiedAt |
string |
The timestamp of resource last modification (UTC) |
lastModifiedBy |
string |
The identity that last modified the resource. |
lastModifiedByType |
The type of identity that last modified the resource. |
UserAssignedIdentity
User assigned identity properties
Name | Type | Description |
---|---|---|
clientId |
string |
The client ID of the assigned identity. |
principalId |
string |
The principal ID of the assigned identity. |