pronunciation and speech-to-text for longer utterances - streaming and saving audio in a storage bucket at the same time

Rob P 21 Reputation points

I have a question regarding cognitive services, specifically for near real-time speech-to-text and pronunciation. What would be the best approach for streaming the audio directly from a reactjs application to Azure, getting the cognitive service response and at the same time storing the audio in a bucket? The bucket content would have to be later available for a different application.
I am trying to avoid the option of recording the user input first, then separately sending the file to the storage bucket and Azure cognitive.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,272 questions
{count} votes

1 answer

Sort by: Most helpful
  1. YutongTie-MSFT 43,076 Reputation points

    @Rob P Thanks for the question again. Do you have a chance to check on Ram's response? Please let us know if you have more questions about this issue, we are willing to help.


    0 comments No comments