Add new TTS technology/project (Coqui / Piper TTS) to SAPI

Question

Add new TTS technology/project (Coqui / Piper TTS) to SAPI

ThorstenVoice 5

Hello,

originally i asked for support here but was redirected here.

I'd love to add a locally running TTS python based software (Coqui TTS and Piper TTS) to Windows SAPI system. I played around with adding new entries to the registry "...Speech_OneCore/Voices/Tokens/" and tested around that GUID "{179F3D56-1B0B-42B2-A962-59B7EF59FE1B}.

I also played around with Powershell scripts that are using SAPI which showed me the list of available voices and i was able to generate spoken audio. But i did not find any information on how i can add custom voices to SAPI.

I was unsure if this is possible in general, but it seems there are solutions for Amazon Polly or ReadSpeaker so it should be possible to add voices in another way.

In general i could create spoken audio by running a python based subprocess which takes some arguments (as the text) and returns wave data.

Can someone show me which way to go - thank you.

3 answers

Your answer

Answer 1

@ThorstenVoice Azure AI speech primarily focuses on Azure speech service. As i understand from your case, you are looking to use voices from Azure speech service that can be used with your application. This can be done using the Azure speech REST API or the SDK or the Azure speech studio. If you are looking to integrate and use the voices in your application, you can use the SDK to list and synthesize text. Here is a learn course to get started where you can create the speech resource on Azure and test the same using the speech studio.

Using the speech studio, you should be able to test the voice output and you can later integrate the TTS voice list APIs to list the voices in your application and then use the TTS API to synthesize text. You can use the documentation to try the samples with the required SDKs.

If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

ThorstenVoice 5 Reputation points

2023-12-06T18:21:36.42+00:00

Thanks for your reply, but this is not my usecase. I do not want to use any Azure AI service or RESTful API.

I want to add a Python based software for LOCAL running (without internet connection) TTS software called "Piper TTS" and add it to the available voices via SAPI system. So that these LOCAL voices can be chosen from the voices / speaker dropdown menu.

Answer 2

999 Limerence 0

hi Torsten, can you show there https://sonur.chimege.com/

ThorstenVoice 5 Reputation points

2023-12-06T18:24:17.72+00:00

Thanks for your link. I downloaded the source code and maybe i can find helpful code snippets on how to add a new speech synthesis system to SAPI5.

Answer 3

Hello Thorsten,

Thank you for bringing up this interesting topic. I realized you like this wonderful text-to-speech engine to be equipped with the Speech API version 5 interface to make using it more convenient for Windows users.

I remember a sample code from Microsoft Speech SDK explaining the implementation of SAPI5 for a sample speech engine.

Please see the below link:

https://www.microsoft.com/en-us/download/details.aspx?id=10121

You need to download this file: SpeechSDK51.exe

Also, you can find the updated documentation online:

https://learn.microsoft.com/en-us/previous-versions/windows/desktop/ms720179(v=vs.85)

Perhaps an updated sample code will also be available in the "Windows SDK":

https://developer.microsoft.com/en-us/windows/downloads/windows-sdk/

Please note using my referred documentation a developer with expertise in C programming language could implement the requested feature!

Hope this wonderful TTS engine someday supports the SAPI5 interface :)

Share via

Add new TTS technology/project (Coqui / Piper TTS) to SAPI

3 answers

Your answer