How can I successfully stream Azure TTS audio from my server to the client using Fetch API and PassThrough push stream?

Question

How can I successfully stream Azure TTS audio from my server to the client using Fetch API and PassThrough push stream?

RL_Anon 5

I am attempting to stream Azure TTS from my server to the client using the Fetch API and a PassThrough push stream. The expected outcome is to receive the stream in chunks. The actual output is a response object with no information. I have tried creating a ReadableStream using Fetch, but when I try to log the response, I get an error message that my response object is size 0. I have also tried to see if anything is getting sent through in chunks, but everything is size 0. I have tried to debug my backend but from everything I can tell, it is working properly. If anyone has solved this issue or has demo code for streaming TTS in JavaScript, please let me know. This is my actual function code. I believe it works.:

const generateSpeechFromText = async (text) => {
  const speechConfig = sdk.SpeechConfig.fromSubscription(
    process.env.SPEECH_KEY,
    process.env.SPEECH_REGION
  );
  speechConfig.speechSynthesisVoiceName = "en-US-JennyNeural";
  speechConfig.speechSynthesisOutputFormat =
    sdk.SpeechSynthesisOutputFormat.Audio16Khz32KBitRateMonoMp3;

  const synthesizer = new sdk.SpeechSynthesizer(speechConfig);

  return new Promise((resolve, reject) => {
    synthesizer.speakTextAsync(
      text,
      (result) => {
        if (result.reason === sdk.ResultReason.SynthesizingAudioCompleted) {
          const bufferStream = new PassThrough();
          bufferStream.end(Buffer.from(result.audioData));
          resolve(bufferStream);
        } else {
          console.error("Speech synthesis canceled: " + result.errorDetails);
          reject(new Error("Speech synthesis failed"));
        }
        synthesizer.close();
      },
      (error) => {
        console.error("Error in speech synthesis: " + error);
        synthesizer.close();
        reject(error);
      }
    );
  });

This is my index.js route code to send to the frontend. I believe it works but there could be an error.:

app.get("/textToSpeech", async (request, reply) => {
  if (textWorks) {
    try {
      const stream = await generateSpeechFromText(
        textWorks
      
      );
      console.log("Stream created, sending to client: ", stream);
      reply.type("audio/mpeg").send(stream);
    } catch (err) {
      console.error(err);
      reply.status(500).send("Error in text-to-speech synthesis");
    }
  } else {
    reply.status(404).send("OpenAI response not found");
  }
});

This is my frontend client code. I think the error has to do with the response object, but I am not sure.:

// Fetch TTS from Backend
export const fetchTTS = async (): Promise

romungi-MSFT 48,911 Reputation points Microsoft Employee Moderator

2024-01-09T08:58:54.76+00:00

@RL_Anon The JS SDK repo documents some tests that for push and pull stream audio for synthesis. Could you please check if this helps? Thanks!!
RL_Anon 5 Reputation points

2024-01-12T06:34:32.6466667+00:00

@romungi-MSFT

Thanks for your response. I actually scoured through the SDK repo previously and tested it. I believe that the server is able to stream the audio just fine. The problem is that the client app cannot handle the streaming chunks, so it is throwing an error.

Your answer

romungi-MSFT 48,911 Reputation points Microsoft Employee Moderator

2024-01-09T08:58:54.76+00:00

@RL_Anon The JS SDK repo documents some tests that for push and pull stream audio for synthesis. Could you please check if this helps? Thanks!!
RL_Anon 5 Reputation points

2024-01-12T06:34:32.6466667+00:00

@romungi-MSFT

Thanks for your response. I actually scoured through the SDK repo previously and tested it. I believe that the server is able to stream the audio just fine. The problem is that the client app cannot handle the streaming chunks, so it is throwing an error.

Share via

How can I successfully stream Azure TTS audio from my server to the client using Fetch API and PassThrough push stream?

Your answer