How to save text to speech audio file with Nodejs to an GCS external bucket?

Question

How to save text to speech audio file with Nodejs to an GCS external bucket?

Rafa Torres 46

We are trying to create a text-to-speech audio file and upload it to an external bucket storage (GCS in our case).

Our problem is to convert the binary data that Azure returns into a Buffer, it seems like the binary data is corrupted because the response gives us "RIFF�~\u0004\u0000WAVEfmt...(continues)" with "��\u0000\u0000\u0000\u0000" in the middle, is it a sign of corruption?

That causes the audio file we created to be empty (I guess).

We use NodeJS with JavaScript as a reference. The default API call is made with Axios:

const azureLongAudio = axios.create({
    baseURL: DEFAULTPATH + apiVersion,
    headers: {
        'Ocp-Apim-Subscription-Key': API_KEY,
        'content-type': 'application/json',
        'X-Microsoft-OutputFormat': 'riff-24khz-16bit-mono-pcm',
        'Content-Type': 'application/ssml+xml'
    }
});

Then we use the next function to call Azure, save a buffer from the returned binary data, create a WAV file, and upload it to an external bucket:

const createAudioFromText = (text) => {
    const version = '1.0';
    const language = supportedLanguages[0];
    const voiceGender = 'Male';
    const voiceName = 'es-ES-EliasNeural';
    const textContent = `
    <speak version='${version}' xml:lang='${language}'>
        <voice xml:lang='${language}' xml:gender='${voiceGender}' name='${voiceName}'>
            ${text}
        </voice>
    </speak>`;
    return new Promise(async (resolve, reject) => {
        const bucket = gcs.getBucket();
        const filename = 'test-01.wav';
        const file = bucket.file(`recordings/${filename}`);

        const { data: audioData } = await azureLongAudio.post('/', textContent).catch((err) => {
            reject(err.response);
        });

        const audioBuffer = Buffer.from(audioData);

        const writer = new wav.FileWriter(filename, {
            sampleRate: 24000, 
            channels: 1, 
            bitDepth: 16, 
            audioFormat: 1 
        });

        writer.write(audioBuffer);
        writer.end();

        writer.on('finish', async () => {
            const writeStream = file.createWriteStream({
                resumable: true,
                contentType: 'audio/wav'
            });

            writer.pipe(writeStream);

            writeStream.on('error', (err) => {
                console.error('Failed to save into GCS:', err);
                reject(err);
            });

            writeStream.on('finish', () => {
                console.log(`Saved in GCS: ${filename}`);
                resolve(audioData);
            });
        });
    });
};

I'd appreciate any response.

Ramr-msft 17,826 Reputation points

2023-09-10T11:19:56.16+00:00
@Rafa Torres Thanks for the question, When you receive the binary data from the API, you should create a buffer directly from the binary data and write it to a file or stream it to your external bucket storage.

Here is an example of how you can create a buffer from the binary data returned by the Azure Text-to-Speech API:

const { data: audioData } = await azureLongAudio.post('/', textContent).catch((err) => { reject(err.response); }); const audioBuffer = Buffer.from(audioData, 'binary');
dfdree 0 Reputation points

2023-09-10T11:22:22.21+00:00
To save text-to-speech audio files with Node.js to a Google Cloud Storage (GCS) external bucket, you can use the Google Cloud Storage Node.js client library. Here are the steps to achieve this:

Set Up Google Cloud Storage:

Ensure that you have a Google Cloud Platform (GCP) project glass company set up.

Create a GCS bucket where you want to store your audio files. Make note of the bucket name and ensure that your GCP project has the necessary permissions to write to this bucket.

1 answer

Your answer

Ramr-msft 17,826 Reputation points

2023-09-10T11:19:56.16+00:00

@Rafa Torres Thanks for the question, When you receive the binary data from the API, you should create a buffer directly from the binary data and write it to a file or stream it to your external bucket storage.

Here is an example of how you can create a buffer from the binary data returned by the Azure Text-to-Speech API:

const { data: audioData } = await azureLongAudio.post('/', textContent).catch((err) => { reject(err.response); }); const audioBuffer = Buffer.from(audioData, 'binary');
dfdree 0 Reputation points

2023-09-10T11:22:22.21+00:00

To save text-to-speech audio files with Node.js to a Google Cloud Storage (GCS) external bucket, you can use the Google Cloud Storage Node.js client library. Here are the steps to achieve this:

Set Up Google Cloud Storage:

Ensure that you have a Google Cloud Platform (GCP) project glass company set up.

Create a GCS bucket where you want to store your audio files. Make note of the bucket name and ensure that your GCP project has the necessary permissions to write to this bucket.

Answer 1

To save text-to-speech audio to a file and upload it to Google Cloud Storage (GCS) using Node.js, you can follow these general steps:

Set Up Google Cloud SDK: Make sure you have the Google Cloud SDK installed and configured on your machine. Set up a GCS bucket for Scottsdale orthokeratology doctors where you want to store the audio file.

Install Required Node.js Packages: Install the necessary Node.js packages for text-to-speech and GCS interaction. You can use the @google-cloud/text-to-speech library for text-to-speech and @google-cloud/storage for interacting with GCS.

Share via

How to save text to speech audio file with Nodejs to an GCS external bucket?

1 answer

Your answer