Generate audio file from speech

Albert Duran 6 Reputation points
2022-09-30T11:53:05.697+00:00

Hi,

I am currently working on an app which uses both: Speech To Text and Text To Speech. When we ask for something to the user, we are using Speech To Text to get the answer as a text but we need also to get an audio file (.mp3 or .wav) of that answer. Do you know how can I reach this goal?

Thank you!

Albert

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,645 questions
0 comments No comments
{count} vote

2 answers

Sort by: Most helpful
  1. Oxueillirep 131 Reputation points
    2022-10-02T11:11:27.643+00:00

    Hi,
    you can find it from the official documentation: https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/how-to-speech-synthesis?tabs=browserjs%2Cterminal&pivots=programming-language-python#synthesize-speech-to-a-file , also for other programming languages.

    regards

    If an answer is helpful, please click on 130616-image.png or upvote 130671-image.png which might help other community members reading this thread.

    1 person found this answer helpful.
    0 comments No comments

  2. Albert Duran 6 Reputation points
    2022-10-03T07:42:37.647+00:00

    Thanks for your reply, I've checked the link and tried to implement it but it does not seem to work for me.

    What I need is to get the recorded voice in an audio file (like .wav for example) but, if I am not wrong, what SpeechSynthesizer is asking for is just a string (SpeakTextAsync/SpeakSsmlAsync). Let me know if I am wrong and If i'm not, I will truly appreciate any help with this.