Azure TTS offers over 100 languages and a wide range of natural human sounds. These voices are accessible via Azure's API, and you can use your preferred voice in API calls.
You can browse the Microsoft Voice Gallery to find and preview different voices. Each voice in the gallery is linked to a "Sample code" tab. By clicking on this, you can copy the voice name (for example, config.SpeechSynthesisVoiceName = "en-GB-AbbiNeural") and paste it into your application's voice_id field.
If you wish to give a voice gallery for consumers to see before choosing a voice for their voiceovers, you'll probably need to generate these samples with Azure TTS and then store them elsewhere, potentially in Azure Blob Storage. This is because, while you can access and use the voice names from Azure's Voice Gallery, you cannot directly reference or fetch pre-existing sample audios from the gallery for usage in your application.
The Azure Speech SDK or Speech Studio portal can be used to integrate these voices into your application. You can choose from pre-built neural voices or create a bespoke neural voice that is specific to your product or business. The documentation provided by Azure is a fantastic place to start when incorporating these capabilities into your apps.
Microsoft's Voice Gallery provides voice samples as well as sample code, which might be useful as a reference. Voice samples and sample code for Azure's new contextual voice models, such as "RogerNeural," can be found at the Voice Gallery. This is especially beneficial if your application has long-form text, such as paragraphs.