Custom Speech モデルをデプロイする

2025-05-25

この記事では、Custom Speech モデルのエンドポイントをデプロイする方法について学習します。バッチ文字起こしを除き、Custom Speech モデルを使用するには、カスタムエンドポイントをデプロイする必要があります。

ヒント

バッチ文字起こし API で Custom Speech を使用するには、ホストされたデプロイエンドポイントは必要ありません。 Custom Speech モデルがバッチ文字起こしにのみ使用される場合は、リソースを節約できます。詳細については、「Speech Services の価格」を参照してください。

基本モデルまたはカスタムモデルのエンドポイントをデプロイし、後でエンドポイントを更新して、より適切なトレーニング済みモデルを使用できます。

注意

F0 Speech リソースによって使用されるエンドポイントは、7 日後に削除されます。

デプロイメントエンドポイントの追加

Azure AI Foundry ポータルにサインインします。
左側のウィンドウから [微調整 ] を選択し、[ AI サービスの微調整] を選択します。
カスタム音声の微調整の開始方法に関する記事の説明に従って、開始したカスタム音声微調整タスク (モデル名別) を選択します。
モデルをデプロイする>+ モデルをデプロイするを選択します。
新しいモデルのデプロイ ウィザードで、デプロイするモデルを選択します。
デプロイの名前と説明を入力します。使用条件に同意するボックスを選択します。配置を選択します。
デプロイの状態が [成功] になったら、デプロイの詳細を表示できます。デプロイを選択して、エンドポイント ID などの詳細を表示します。

カスタムエンドポイントを作成するには、これらの手順に従います。

Speech Studio にサインインします。
[Custom Speech]> プロジェクト名 >[モデルのデプロイ] を選択します。

これが最初のエンドポイントの場合、テーブルにはエンドポイントが表示されていません。エンドポイントを作成したら、このページを使用して、デプロイされた各エンドポイントを追跡します。
[モデルのデプロイ] を選択して、新しいエンドポイントウィザードを開始します。
[新しいエンドポイント] ページで、カスタムエンドポイントの名前と説明を入力します。
エンドポイントに関連付けるカスタムモデルを選択します。
必要に応じてチェックボックスをオンにして、エンドポイントのトラフィックの音声ログと診断ログを有効にすることができます。
[追加] を選択して、エンドポイントを保存してデプロイします。

メインの [モデルのデプロイ] ページで、新しいエンドポイントの詳細 (名前、説明、状態、有効期限など) がテーブルに表示されます。カスタムモデルを使用する新しいエンドポイントをインスタンス化するには、最大で 30 分かかることがあります。デプロイの状態が [成功] に変わると、エンドポイントは使用できる状態です。

重要

モデルの有効期限の日付をメモします。これは、音声認識にカスタムモデルを使用できる最後の日付です。詳細については、「モデルとエンドポイントのライフサイクル」を参照してください。

エンドポイントリンクを選択すると、エンドポイントキー、エンドポイントの URL、サンプルコードなど、そのエンドポイントに固有の情報が表示されます。

エンドポイントを作成してモデルをデプロイするには、spx csr endpoint create コマンドを使用します。次の手順に従って要求パラメーターを作成します。

project プロパティを既存のプロジェクトの ID に設定します。このプロパティは、 Azure AI Foundry ポータルでエンドポイントを表示および管理できるように推奨されます。 spx csr project list コマンドを実行すると、使用できるプロジェクトを取得できます。
必要な model プロパティを、エンドポイントにデプロイするモデルの ID に設定します。
必須の language プロパティを設定します。エンドポイントロケールは、モデルのロケールと一致する必要があります。ロケールを後から変更することはできません。 Speech CLI language プロパティは、JSON 要求と応答の locale プロパティに対応します。
必須の name プロパティを設定します。これは、 Azure AI Foundry ポータルに表示される名前です。 Speech CLI name プロパティは、JSON 要求と応答の displayName プロパティに対応します。
必要に応じて、logging プロパティを設定できます。エンドポイントのトラフィックの音声ログと診断 enabled を有効にするには、これをに設定します。既定値は、false です。

エンドポイントを作成してモデルをデプロイする Speech CLI コマンドの例を次に示します。

spx csr endpoint create --api-version v3.2 --project YourProjectId --model YourModelId --name "My Endpoint" --description "My Endpoint Description" --language "en-US"

次の形式で応答本文を受け取る必要があります。

{
  "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
  "model": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/models/9e240dc1-3d2d-4ac9-98ec-1be05ba0e9dd"
  },
  "links": {
    "logs": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/a07164e8-22d1-4eb7-aa31-bf6bb1097f37/files/logs",
    "restInteractive": "https://eastus.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "restConversation": "https://eastus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "restDictation": "https://eastus.stt.speech.microsoft.com/speech/recognition/dictation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketInteractive": "wss://eastus.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketConversation": "wss://eastus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketDictation": "wss://eastus.stt.speech.microsoft.com/speech/recognition/dictation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37"
  },
  "project": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/projects/0198f569-cc11-4099-a0e8-9d55bc3d0c52"
  },
  "properties": {
    "loggingEnabled": true
  },
  "lastActionDateTime": "2024-07-15T16:29:36Z",
  "status": "NotStarted",
  "createdDateTime": "2024-07-15T16:29:36Z",
  "locale": "en-US",
  "displayName": "My Endpoint",
  "description": "My Endpoint Description"
}

応答本文の最上位の self プロパティはエンドポイントの URI です。この URI を使用して、エンドポイントのプロジェクト、モデル、ログに関する詳細を取得します。また、この URI を使用してエンドポイントを更新します。

エンドポイントに関する Speech CLI ヘルプを表示するには、次のコマンドを実行します。

spx help csr endpoint

エンドポイントを作成してモデルをデプロイするには、Speech to text REST API の Endpoints_Create 操作を使用します。次の手順に従って要求本文を作成します。

project プロパティを既存のプロジェクトの URI に設定します。このプロパティは、 Azure AI Foundry ポータルでエンドポイントを表示および管理できるように推奨されます。 Projects_List 要求を行うと、使用できるプロジェクトを取得できます。
必須の model プロパティをエンドポイントにデプロイするモデルの ID に設定します。
必須の locale プロパティを設定します。エンドポイントロケールは、モデルのロケールと一致する必要があります。ロケールを後から変更することはできません。
必須の displayName プロパティを設定します。これは、 Azure AI Foundry ポータルに表示される名前です。
必要に応じて、loggingEnabled の properties プロパティを設定できます。エンドポイントのトラフィックの音声ログと診断 true を有効にするには、これをに設定します。既定値は、false です。

HTTP POST 要求は、以下の Endpoints_Create の例に示したように URI を使用して行います。 YourSpeechResoureKey は実際の Speech リソースキーに、YourServiceRegion は実際の Speech リソースリージョンに置き換えたうえで、前述のように要求本文のプロパティを設定してください。

curl -v -X POST -H "Ocp-Apim-Subscription-Key: YourSpeechResoureKey" -H "Content-Type: application/json" -d '{
  "project": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/projects/0198f569-cc11-4099-a0e8-9d55bc3d0c52"
  },
  "properties": {
    "loggingEnabled": true
  },
  "displayName": "My Endpoint",
  "description": "My Endpoint Description",
  "model": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/models/base/ae8d1643-53e4-4554-be4c-221dcfb471c5"
  },
  "locale": "en-US",
}'  "https://YourServiceRegion.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints"

次の形式で応答本文を受け取る必要があります。

{
  "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
  "model": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/models/9e240dc1-3d2d-4ac9-98ec-1be05ba0e9dd"
  },
  "links": {
    "logs": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/a07164e8-22d1-4eb7-aa31-bf6bb1097f37/files/logs",
    "restInteractive": "https://eastus.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "restConversation": "https://eastus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "restDictation": "https://eastus.stt.speech.microsoft.com/speech/recognition/dictation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketInteractive": "wss://eastus.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketConversation": "wss://eastus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketDictation": "wss://eastus.stt.speech.microsoft.com/speech/recognition/dictation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37"
  },
  "project": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/projects/0198f569-cc11-4099-a0e8-9d55bc3d0c52"
  },
  "properties": {
    "loggingEnabled": true
  },
  "lastActionDateTime": "2024-07-15T16:29:36Z",
  "status": "NotStarted",
  "createdDateTime": "2024-07-15T16:29:36Z",
  "locale": "en-US",
  "displayName": "My Endpoint",
  "description": "My Endpoint Description"
}

応答本文の最上位の self プロパティはエンドポイントの URI です。この URI を使用して、エンドポイントのプロジェクト、モデル、ログに関する詳細を取得します。また、エンドポイントの更新または削除にもこの URI を使用します。

モデルの変更とエンドポイントの再デプロイ

エンドポイントは、同じ Speech リソースによって作成された別のモデルを使用するように更新できます。前述のように、モデルの有効期限が切れる前に、エンドポイントのモデルを更新する必要があります。

新しいモデルを使用してカスタムエンドポイントを再デプロイするには:

Speech Studio にサインインします。
[Custom Speech]> プロジェクト名 >[モデルのデプロイ] を選択します。
エンドポイントへのリンクを名前で選択し、[モデルの変更] を選択します。
エンドポイントで使用する新しいモデルを選択します。
[完了] を選択し、エンドポイントを保存して再デプロイします。

新しいモデルでカスタムエンドポイントを再デプロイするには、spx csr model update コマンドを使用します。次の手順に従って要求パラメーターを作成します。

必要な endpoint プロパティを、デプロイするエンドポイントの ID に設定します。
必要な model プロパティを、エンドポイントにデプロイするモデルの ID に設定します。

新しいモデルでカスタムエンドポイントを再デプロイする Speech CLI コマンドの例を次に示します。

spx csr endpoint update --api-version v3.2 --endpoint YourEndpointId --model YourModelId

次の形式で応答本文を受け取る必要があります。

{
  "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
  "model": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/models/9e240dc1-3d2d-4ac9-98ec-1be05ba0e9dd"
  },
  "links": {
    "logs": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/a07164e8-22d1-4eb7-aa31-bf6bb1097f37/files/logs",
    "restInteractive": "https://eastus.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "restConversation": "https://eastus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "restDictation": "https://eastus.stt.speech.microsoft.com/speech/recognition/dictation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketInteractive": "wss://eastus.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketConversation": "wss://eastus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketDictation": "wss://eastus.stt.speech.microsoft.com/speech/recognition/dictation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37"
  },
  "project": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/projects/0198f569-cc11-4099-a0e8-9d55bc3d0c52"
  },
  "properties": {
    "loggingEnabled": true
  },
  "lastActionDateTime": "2024-07-15T16:30:12Z",
  "status": "Succeeded",
  "createdDateTime": "2024-07-15T16:29:36Z",
  "locale": "en-US",
  "displayName": "My Endpoint",
  "description": "My Endpoint Description"
}

エンドポイントに関する Speech CLI ヘルプを表示するには、次のコマンドを実行します。

spx help csr endpoint

新しいモデルでカスタムエンドポイントをもう一度デプロイするには、Speech to text REST API の Endpoints_Update 操作を使用します。次の手順に従って要求本文を作成します。

model プロパティをエンドポイントにデプロイするモデルの ID に設定します。

HTTP PATCH 要求は、次の例に示すように URI を使って行います。 YourSpeechResoureKey　を Speech リソースキーに置き換え、YourServiceRegion を Speech リソース領域に置き換え、YourEndpointId をエンドポイント ID に置き換え、前述のようにリクエスト本文のプロパティを設定します。

curl -v -X PATCH -H "Ocp-Apim-Subscription-Key: YourSpeechResoureKey" -H "Content-Type: application/json" -d '{
  "model": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/models/9e240dc1-3d2d-4ac9-98ec-1be05ba0e9dd"
  },
}'  "https://YourServiceRegion.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/YourEndpointId"

次の形式で応答本文を受け取る必要があります。

{
  "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
  "model": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/models/9e240dc1-3d2d-4ac9-98ec-1be05ba0e9dd"
  },
  "links": {
    "logs": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/a07164e8-22d1-4eb7-aa31-bf6bb1097f37/files/logs",
    "restInteractive": "https://eastus.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "restConversation": "https://eastus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "restDictation": "https://eastus.stt.speech.microsoft.com/speech/recognition/dictation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketInteractive": "wss://eastus.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketConversation": "wss://eastus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketDictation": "wss://eastus.stt.speech.microsoft.com/speech/recognition/dictation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37"
  },
  "project": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/projects/0198f569-cc11-4099-a0e8-9d55bc3d0c52"
  },
  "properties": {
    "loggingEnabled": true
  },
  "lastActionDateTime": "2024-07-15T16:30:12Z",
  "status": "Succeeded",
  "createdDateTime": "2024-07-15T16:29:36Z",
  "locale": "en-US",
  "displayName": "My Endpoint",
  "description": "My Endpoint Description"
}

再デプロイが完了するまで、数分間かかります。それまでの間、エンドポイントはサービスを中断することなく、以前のモデルを使用します。

ログデータを表示する

ログデータは、エンドポイントの作成時に構成した場合にエクスポートできます。

エンドポイントログをダウンロードするには:

Speech Studio にサインインします。
[Custom Speech]> プロジェクト名 >[モデルのデプロイ] を選択します。
リンクをエンドポイント名で選択します。
[コンテンツログ] で、[ログのダウンロード] を選択します。

エンドポイントのログを取得するには、spx csr endpoint list コマンドを使用します。次の手順に従って要求パラメーターを作成します。

必要な endpoint プロパティを、ログを取得するエンドポイントの ID に設定します。

エンドポイントにログを取得する Speech CLI コマンドの例を次に示します。

spx csr endpoint list --api-version v3.2 --endpoint YourEndpointId

各ログファイルの場所と詳細は、応答本文で返されます。

エンドポイントのログを取得するには、まず Speech to text REST API の Endpoints_Get 操作を使用します。

HTTP GET 要求は、次の例に示すように URI を使用して行います。 YourEndpointId をエンドポイント ID に置き換え、YourSpeechResoureKey を Speech リソースキーに置き換えて、YourServiceRegion を Speech リソースリージョンに置き換えます。

curl -v -X GET "https://YourServiceRegion.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/YourEndpointId" -H "Ocp-Apim-Subscription-Key: YourSpeechResoureKey"

次の形式で応答本文を受け取る必要があります。

{
  "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
  "model": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/models/9e240dc1-3d2d-4ac9-98ec-1be05ba0e9dd"
  },
  "links": {
    "logs": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/a07164e8-22d1-4eb7-aa31-bf6bb1097f37/files/logs",
    "restInteractive": "https://eastus.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "restConversation": "https://eastus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "restDictation": "https://eastus.stt.speech.microsoft.com/speech/recognition/dictation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketInteractive": "wss://eastus.stt.speech.microsoft.com/speech/recognition/interactive/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketConversation": "wss://eastus.stt.speech.microsoft.com/speech/recognition/conversation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37",
    "webSocketDictation": "wss://eastus.stt.speech.microsoft.com/speech/recognition/dictation/cognitiveservices/v1?cid=a07164e8-22d1-4eb7-aa31-bf6bb1097f37"
  },
  "project": {
    "self": "https://eastus.api.cognitive.microsoft.com/speechtotext/v3.2/projects/0198f569-cc11-4099-a0e8-9d55bc3d0c52"
  },
  "properties": {
    "loggingEnabled": true
  },
  "lastActionDateTime": "2024-07-15T16:30:12Z",
  "status": "Succeeded",
  "createdDateTime": "2024-07-15T16:29:36Z",
  "locale": "en-US",
  "displayName": "My Endpoint",
  "description": "My Endpoint Description"
}

前の応答本文の "logs" URI を使用して HTTP GET 要求を行います。 YourEndpointId をエンドポイント ID に置き換え、YourSpeechResoureKey を Speech リソースキーに置き換えて、YourServiceRegion を Speech リソースリージョンに置き換えます。

curl -v -X GET "https://YourServiceRegion.api.cognitive.microsoft.com/speechtotext/v3.2/endpoints/YourEndpointId/files/logs" -H "Ocp-Apim-Subscription-Key: YourSpeechResoureKey"

各ログファイルの場所と詳細は、応答本文で返されます。

ログデータは、30 日間は Microsoft 所有のストレージ上で利用でき、その後削除されます。自身のストレージアカウントが Azure AI サービスサブスクリプションにリンクされている場合、ログデータは自動的には削除されません。

次の方法で共有

Custom Speech モデルをデプロイする

デプロイメント エンドポイントの追加

モデルの変更とエンドポイントの再デプロイ

ログ データを表示する

関連するコンテンツ

フィードバック

その他のリソース

デプロイメントエンドポイントの追加

ログデータを表示する