Azure AI Speech

0 answers

How-to setup Speech SDK with MAS (AEC) in Unity

Hello everyone, I am trying to implement Acoustic Echo Cancellation (AEC) in a Unity project using the Azure Speech SDK and the Microsoft Audio Stack (MAS), but I cannot get it to work correctly. The speech recognizer continues to pick up and transcribe…

asked

nk 0

1 answer

Struggling to calculate costs for a few services for Avatar AI

Hello all Look at this sample code as it kind of summarizes what I want to do https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/python/web/avatar/README.md Basically, an A.I. avatar that will use a custom prompt, and a…

asked

It is VMS 100

commented

Pavankumar Purilla 8,665 Microsoft External Staff Moderator

1 answer

Video Translation is failing, both in API and on Speech Studio

Video translation worked last month, but seems to have died. Seems like an Azure problem as the issue can be duplicated on Speech Studio, thereby ruling out my code (was previously worked). The translation record shows success, but the iteration record…

asked

Paul Rony 5

commented

PANKAJ, Pranit 0

1 answer

azure cognitive service text to speech integration from genesys

I'm trying to add azure cognitive service text to speech from genesys cloud and receive this error Failed to validate microfost-azure-cognitive-services-text-to-speech integration .Unexpected error. No sure if it's becuase my azure account or what could…

asked

Dario Torres 0

commented

Prashanth Veeragoni 5,645 Microsoft External Staff Moderator

0 answers

Can we get a confidence score for the AutoDetect the source language using AutoDetectSourceLanguageConfig

I am working on Speech to Text services in Azure. We're using autodetect feature with both specified set and open range using below class. Is there a way that I can get the confidence score for the source language detected. So that only if the score…

asked

Midhilesh Momidi 0

edited a comment

Manas Mohanty 6,455 Microsoft External Staff Moderator

0 answers

Missing Azure TTS voices with Genesys Cloud CX Connector

The Genesys Cloud CX contact center platform has a Microsoft Azure TTS connector available in their AppFoundry which we are using – the connector itself is provided by Microsoft and is called “Microsoft Azure Cognitive Services Text To Speech”. We have…

asked

Rich Bartolucci 0

2 answers

Azure Speech SDK - Formal list of Languages/Locales Supported for Semantic Speech segmentation

Per https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-recognize-speech?pivots=programming-language-csharp Semantic segmentation isn't available for all languages and locales. Can Microsoft provide a list of languages/locales for…

asked

VS 0

edited a comment

Prashanth Veeragoni 5,645 Microsoft External Staff Moderator

0 answers

Azure Real-Time diarization

Hi! I am working on a project in Python, in which I use Azure AI Speech Service. More specifically, I implemented real-time dairization using the azure.cognitiveservices.speech.transcription.ConversationTranscriber class. And now I am working on speaker…

asked

Karyna Khinevich 0

1 answer

Cannot run SPX under dotnet 8 for mac arm64 version

I have followed instructions to install dotnet 8 and the Azure speech CLI (from: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/spx-basics?tabs=windowsinstall%2Cterminal) and can verify my dotnet installation, and I have updated the…

asked

Noah Scott 0

answered

Amira Bedhiafi 34,731 Volunteer Moderator

2 answers

unable to estimate avatar usage

We are not able to correctly understand the usage of a AVATAR Resources on our Azure Resources.

asked

Alessandro Brizzolesi 40

commented

Manas Mohanty 6,455 Microsoft External Staff Moderator

1 answer

Custom Speech Dataset

So I'm experimenting with a project and I have compiled my dataset, the first step is the Azure Speech Services, TTS> Custom Speech, I wanted to upload my dataset to my project, it contains both Audio and Transcript, I was the one who created the…

asked

Sharmaine Erika Delgado 25

commented

Sharmaine Erika Delgado 25

1 answer

Azure AI Speech Recognition Batch Transcription Services are Down

Hello Team, it has been brought to our attention by our users that Azure AI Speech Recognition Batch Transcription Services are down from past 8 hours (beginning 1:30 AM UTC). Can you check and ensure service normalcy at the earliest?

asked

Panini Devs 0

answered

Amira Bedhiafi 34,731 Volunteer Moderator

2 answers

Latency in Azure Speech service

Latency In azure speech service , due to which our speech to text job is failing

asked

Apoorv Kumar 0

commented

Ravada Shivaprasad 550 Microsoft External Staff Moderator

1 answer

Intermittent authentication errors using batch transcription

Hello, I am facing intermittent container authentication issues when transcribing an audio with the batch transcription API. The Speech resource has Storage Blob Data Reader permission on the container where the audio files are stored. Yesterday, I tried…

asked

Yan Gonçalves 0

commented

Pavankumar Purilla 8,665 Microsoft External Staff Moderator

1 answer

30 secs timeout on Azure speech to text

Hello, I'm experiencing an issue with Azure Speech-to-Text where, in continuous recognition mode, it outputs a RECOGNIZED result every 30 seconds, regardless of whether speech has stopped. Adjusting settings like Speech_SegmentationSilenceTimeoutMs has…

asked

Nandhu TS 0

commented

Ravada Shivaprasad 550 Microsoft External Staff Moderator

1 answer

Azure Speech SDK JavaScript - Silence timeout properties not working for continuous recognition

I'm using the Azure Speech SDK for JavaScript (microsoft-cognitiveservices-speech-sdk) to implement continuous speech recognition, but I'm unable to increase the silence timeout duration. The recognition still stops after the default silence period (~2-3…

asked

MI Sajid 0

answered

Amira Bedhiafi 34,731 Volunteer Moderator

1 answer

Tranlsation application - the synthesized audio text may not perfectly match the original video timing, TTS speed mismatch

Hi Team, I’m developing a language translation application that generates translated video files using Azure Text-to-Speech and .NET (C#). The workflow involves generating audio from translated text and combining it with video visuals. However, I’ve…

asked

Aravind R 0

commented

Aravind R 0

1 answer

Does Azure Pronunciation Assessment handle Hong Kong, Japanese, and other East Asian English accents accurately?

We’re building a language learning app for English speakers in Hong Kong, Japan, and other East Asian countries. We plan to integrate Azure Speech Service — Pronunciation Assessment using PHP (Laravel). My main question is: How well does it handle…

asked

Darsh Al 20

accepted

Darsh Al 20

0 answers

[Setting up STT Resource]: Configure your account

I am new to Azure and have been trying to create resource for STT. However, I am stuck in the subscription process, where it seems I must "configure my acocunt". The subscription wouldn't be marked as complete until I do so, and to do so, there…

asked

nimesh.s 0

commented

Pavankumar Purilla 8,665 Microsoft External Staff Moderator

1 answer

Why may the Basic Custom Keyword model be taking over 9 hours to complete for 1 word with 1 prefix?

Hey folks, I have created 1 custom keyword with the word "Hey" as a prefix. There are 2 distinct pronunciations chosen for the word and the prefix and the Model Type is "Basic". In the display it mentions that the model may take…

asked

Shreyas Chitransh 20

accepted

Shreyas Chitransh 20

Filter

Content

2,079 questions with Azure AI Speech tags

How-to setup Speech SDK with MAS (AEC) in Unity

Struggling to calculate costs for a few services for Avatar AI

Video Translation is failing, both in API and on Speech Studio

azure cognitive service text to speech integration from genesys

Can we get a confidence score for the AutoDetect the source language using AutoDetectSourceLanguageConfig

Missing Azure TTS voices with Genesys Cloud CX Connector

Azure Speech SDK - Formal list of Languages/Locales Supported for Semantic Speech segmentation

Azure Real-Time diarization

Cannot run SPX under dotnet 8 for mac arm64 version

unable to estimate avatar usage

Custom Speech Dataset

Azure AI Speech Recognition Batch Transcription Services are Down

Latency in Azure Speech service

Intermittent authentication errors using batch transcription

30 secs timeout on Azure speech to text

Azure Speech SDK JavaScript - Silence timeout properties not working for continuous recognition

Tranlsation application - the synthesized audio text may not perfectly match the original video timing, TTS speed mismatch

Does Azure Pronunciation Assessment handle Hong Kong, Japanese, and other East Asian English accents accurately?

[Setting up STT Resource]: Configure your account

Why may the Basic Custom Keyword model be taking over 9 hours to complete for 1 word with 1 prefix?