Stopping Audio Playback Mid-Stream with Microsoft Neural TTS Service and Speech SDK

Question

Stopping Audio Playback Mid-Stream with Microsoft Neural TTS Service and Speech SDK

MD SHAKIL KHAN 0

I'm working with the Microsoft Neural Text-to-Speech (TTS) service using the Speech SDK. I've successfully implemented audio playback, but I'm facing a challenge with controlling the playback mid-stream.

My question is: How can I implement a feature to stop the audio playback at any given moment while using the Speech SDK with the Neural TTS service?

Specifically, I'm looking for:

The appropriate method or function to call for stopping audio playback

Any considerations or best practices for handling this interruption

Examples or code snippets demonstrating how to implement this feature (in Javascript)

I appreciate any guidance or solutions you can provide. Thank you in advance for your help!

1 answer

Your answer

Answer 1

Sina Salam 22,031 Volunteer Moderator

Hello MD SHAKIL KHAN,

Welcome to the Microsoft Q&A and thank you for posting your questions here.

I understand that you would like to stop audio playback mid-stream using the Microsoft Neural Text-to-Speech (TTS) service with the Speech SDK in JavaScript.

You can call the close method on the AudioConfig object to stop the audio playback using JavaScript, Where startSynthesis starts the speech synthesis and stopSynthesis stops the audio playback by calling the close method on the audioConfig object. An example of how you can implement ii is here below:

const sdk = require("microsoft-cognitiveservices-speech-sdk");
// Initialize the Speech SDK
const speechConfig = sdk.SpeechConfig.fromSubscription("YourSubscriptionKey", "YourServiceRegion");
const audioConfig = sdk.AudioConfig.fromDefaultSpeakerOutput();
// Create a speech synthesizer
const synthesizer = new sdk.SpeechSynthesizer(speechConfig, audioConfig);
// Function to start speech synthesis
function startSynthesis(text) {
    synthesizer.speakTextAsync(
        text,
        result => {
            if (result.reason === sdk.ResultReason.SynthesizingAudioCompleted) {
                console.log("Synthesis completed.");
            } else {
                console.error("Synthesis failed. Error details: " + result.errorDetails);
            }
        },
        error => {
            console.error("Error during synthesis: " + error);
        }
    );
}
// Function to stop speech synthesis
function stopSynthesis() {
    audioConfig.close();
    console.log("Audio playback stopped.");
}

You can modify it to suite your application.

I hope this is helpful! Do not hesitate to let me know if you have any other questions.

Please don't forget to close up the thread here by upvoting and accept it as an answer if it is helpful.

Saideep Anchuri 9,500 Reputation points Moderator

2024-10-28T19:33:22.6033333+00:00

Hello MD SHAKIL KHAN,

Following up to see if the given response was helpful.

Thank You.
Deleted

This comment has been deleted due to a violation of our Code of Conduct. The comment was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.
MD SHAKIL KHAN 0 Reputation points

2024-10-29T11:51:07.9833333+00:00
I have used the official CDN link for SDK and initialized the SDK on Windows load:

<script src="https://cdn.jsdelivr.net/npm/microsoft-cognitiveservices-speech-sdk@latest/distrib/browser/microsoft.cognitiveservices.speech.sdk.bundle-min.js"> </script> const sdk = window.SpeechSDK; let audioConfig = sdk.AudioConfig.fromDefaultSpeakerOutput(); let synthesizer = new sdk.SpeechSynthesizer(speechConfig, audioConfig);

The issue is audioConfig.close() does not stop the audio playback. but the audioConfig.privDestinatio.pause() works and pauses my audio, and resume() function start playing from last paused.

But I want to know if the audio is paused or stopped in the midstream, then I don't want to play the remaining paused audio. Directly I want to skip that audio and move to the next one.
Is there any functionality for clearing the previous audio buffer and playing the upcoming audio...??

Sina Salam 22,031 Volunteer Moderator

Hello MD SHAKIL KHAN,

Thank you for your feedback, I'm glad to read about your progress.

Regarding your question and specific scenario:

But I want to know if the audio is paused or stopped in the midstream, then I don't want to play the remaining paused audio. Directly I want to skip that audio and move to the next one. Is there any functionality for clearing the previous audio buffer and playing the upcoming audio...??

With the approach below, you can stop the current audio, clear the buffer, and immediately start the next audio synthesis without any interruptions. By calling synthesizer.close(), for your specific scenario without any leftover data from the previous synthesis:

<script src="https://cdn.jsdelivr.net/npm/microsoft-cognitiveservices-speech-sdk@latest/distrib/browser/microsoft.cognitiveservices.speech.sdk.bundle-min.js"></script>
const sdk = window.SpeechSDK;
const speechConfig = sdk.SpeechConfig.fromSubscription("YourSubscriptionKey", "YourServiceRegion");
let audioConfig = sdk.AudioConfig.fromDefaultSpeakerOutput();
let synthesizer = new sdk.SpeechSynthesizer(speechConfig, audioConfig);
// Function to start speech synthesis
function startSynthesis(text) {
    synthesizer.speakTextAsync(
        text,
        result => {
            if (result.reason === sdk.ResultReason.SynthesizingAudioCompleted) {
                console.log("Synthesis completed.");
            } else {
                console.error("Synthesis failed. Error details: " + result.errorDetails);
            }
        },
        error => {
            console.error("Error during synthesis: " + error);
        }
    );
}
// Function to stop and clear the current synthesis
function stopAndClearSynthesis() {
    synthesizer.close();
    console.log("Audio playback stopped and buffer cleared.");
    // Reinitialize the synthesizer to be ready for the next synthesis
    synthesizer = new sdk.SpeechSynthesizer(speechConfig, audioConfig);
}
// Example usage
startSynthesis("Hello, this is a test.");
// Call stopAndClearSynthesis() to stop and clear the current audio

Success

Saideep Anchuri 9,500 Reputation points Moderator

2024-10-29T18:04:42.02+00:00

Hello MD SHAKIL KHAN,

The response provided by @Sina Salam is helpful. if this answer's you query, please retake the survey on the initial response.

Thank you.
Saideep Anchuri 9,500 Reputation points Moderator

2024-10-30T15:03:32.6666667+00:00

Hello MD SHAKIL KHAN,

We haven’t heard from you on the last response and was just checking back to see if the give response was helpful.

Thank You.

Share via

Stopping Audio Playback Mid-Stream with Microsoft Neural TTS Service and Speech SDK

1 answer

Your answer