Serverless Endpoint Creation Doesn't Work via API

Andrei Gaspar 0 Reputation points
2024-06-19T17:16:24.21+00:00

I am trying to create a Serverless Endpoint as outlined in the documentation here.

No matter what platform I try to send the request from it fails with the same error message:

{
  "error": {
    "code": "UserError",
    "message": "Error when parsing request; unable to deserialize request body",
    "details": [],
    "additionalInfo": [
      {
        "type": "ComponentName",
        "info": {
          "value": "managementfrontend"
        }
      },
      {
        "type": "Correlation",
        "info": {
          "value": {
            "operation": "128df3fc69ae065ebc65e39be188c15c",
            "request": "3c447b922622b757"
          }
        }
      },
      {
        "type": "Environment",
        "info": {
          "value": "swedencentral"
        }
      },
      {
        "type": "Location",
        "info": {
          "value": "swedencentral"
        }
      },
      {
        "type": "Time",
        "info": {
          "value": "2024-06-19T17:04:26.5506093+00:00"
        }
      }
    ]
  }
}

I suspect there is something wrong with my request body, but not sure what it might be, since it is based on the API docs, and I tried all the property-value combinations I could think of.

{
    "properties": {
        "authMode": "Key",
        "contentSafety": {
            "contentSafetyStatus": "Disabled"
        },
        "modelSettings": {
            "modelId": "azureml://registries/azureml/models/mistralai-Mistral-7B-Instruct-v01/versions/10"
        }
    },
    "location": "swedencentral",
    "kind": "ServerlessEndpoint",
    "identity": {
        "type": "None"
    }
}

The identity field provided in the API example doesn't work, it says the only supported value is None

Any insights into this would be appreciated!

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,673 questions
Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
1,070 questions
{count} votes