Share via

Automating Training for Custom Speech Model

Sharvani, Popuri Sai 21 Reputation points
2021-07-07T13:49:18.977+00:00

I am working on a requirement where I have created a custom speech model for Azure Chatbot. I have uploaded some audio files in https://speech.microsoft.com , created a model and then deployed it manually for getting model id and speech recognition endpoint id. I have done all these steps manually in speech studio, but I want to know if there is any way to automate this process for updating the files.

I have used swagger API https://westus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-0/operations/CreateModel but when I create datasets and models using this API, I will have to click on "Connect" in speech studio every time I create anything new as below. So, I want to know if adding new files, creating new models and deploying models can be automated with out manually clicking Connect as below, because unless I click Connect the new files are not reflecting in the portal.

112644-image.png

Azure Speech in Foundry Tools

Answer accepted by question author

YutongTie-9091 54,026 Reputation points Moderator
2021-07-16T09:49:34.257+00:00

@Sharvani, Popuri Sai
I have done some research about this issue. Assigning models, datasets. etc. to a project is optional. Customer is welcome to use Speech-to-Text REST API to completely automate the training process. This is API Reference https://centralus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-0/operations/CopyModelToSubscription

If you would like to keep association with a Speech Studio project,. this is possible also. (I have confirmed that there is a backlog to update Swaggger doc on this).

This is how it's done:

an example of the request body for Create Dataset request (https://centralus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-0/operations/CreateDataset) with the project reference.

{

           "kind": "Acoustic",  

           "contentUrl": "https://contoso.com/location. May point to any publicly readable location: Azure blob, AWS S3, etc.",  

           "locale": "en-US",  

           "displayName": "My speech dataset name",  

           "description": "My speech dataset description",  

           "project": {  

                          "self": "https://westeurope.api.cognitive.microsoft.com/speechtotext/v3.0/projects/c1c643ae-7da5-4e38-9853-e*********2"  

           }  

Hope this helps.

Regards,
Yutong
}

Was this answer helpful?


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.