Generate audio file from speech

Albert Duran 6 Reputation points


I am currently working on an app which uses both: Speech To Text and Text To Speech. When we ask for something to the user, we are using Speech To Text to get the answer as a text but we need also to get an audio file (.mp3 or .wav) of that answer. Do you know how can I reach this goal?

Thank you!


Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,438 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,443 questions
0 comments No comments
{count} vote

2 answers

Sort by: Most helpful
  1. Oxueillirep 131 Reputation points

    you can find it from the official documentation: , also for other programming languages.


    If an answer is helpful, please click on 130616-image.png or upvote 130671-image.png which might help other community members reading this thread.

    1 person found this answer helpful.
    0 comments No comments

  2. Albert Duran 6 Reputation points

    Thanks for your reply, I've checked the link and tried to implement it but it does not seem to work for me.

    What I need is to get the recorded voice in an audio file (like .wav for example) but, if I am not wrong, what SpeechSynthesizer is asking for is just a string (SpeakTextAsync/SpeakSsmlAsync). Let me know if I am wrong and If i'm not, I will truly appreciate any help with this.