Speech to Speech Translation

Chris Hooper 1 Reputation point Microsoft Employee
2021-04-11T16:51:38.853+00:00

Is it possible to use Cognitive Services to perform speech to speech translation of a video file or streamed video? If so, where do I start as most of the documentation leads to speech to text translation.

Thanks,
Chris

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,078 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. Stu Kennedy 6 Reputation points
    2022-08-02T00:20:18.947+00:00

    what is the API for doing speech-to-speech ...

    The only way I can figure out doing it is sdk.TranslationRecognizer to go from audio to text.
    And then sdk.SpeechSynthesizer to synthesize the translated text.
    Is there a way to do it in one step?

    Also the translationRecognizer seems to give up early (at 30 seconds) or when encountering a pause in the audio.
    How do I get it to process the whole file and keep going past pauses?

    1 person found this answer helpful.

  2. YutongTie-MSFT 53,976 Reputation points Moderator
    2021-04-12T00:28:32.527+00:00

    Hello,

    Thanks for reaching out to us. There is one service call Speech Translation under Azure Speech Service https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-translation

    Above document describes the benefits and capabilities of the speech translation service, which enables real-time, multi-language speech-to-speech and speech-to-text translation of audio streams. With the Speech SDK, your applications, tools, and devices have access to source transcriptions and translation outputs for provided audio. Interim transcription and translation results are returned as speech is detected, and final results can be converted into synthesized speech.

    Hope this helps.

    Regards,
    Yutong


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.