Azure AI Speech

1 answer

Logs Not Generated for Custom Model in Azure Speech Services

Why are the logs not being generated for my custom speech model in Azure Speech Services, and how can I ensure that transcription is using the custom model? PS - Based on common issues that we have seen from customers and other sources, we are posting…

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

Training German Audio Files in Azure Custom Speech Fails

What is causing the failure when trying to train German audio files with human-made transcripts in Azure Custom Speech? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help the Azure…

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

How to Monitor Speech Service Models and Understand Rate Limits in Azure

How can I monitor my Azure Speech Service models, enable alerts, and understand the rate limits for real-time transcription and other speech resources? PS - Based on common issues that we have seen from customers and other sources, we are posting these…

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

Ensuring Uninterrupted Speech Services During Azure Failover Scenarios

How can I ensure that speech recognition and synthesis services continue without interruption during an Azure failover scenario? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help the…

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

Fixing Premature Session Stopping in Azure Speech SDK for Long Audios

What should I do if I receive a "Session Stopped" message before the end of my audio file when using Azure Speech SDK? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help the…

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

0 answers

I want to use the api

I want to use this API. When can I use it https://learn.microsoft.com/zh-cn/azure/ai-services/speech-service/fast-transcription-create

asked

莓草 0

commented

dupammi 8,185 Microsoft Vendor

1 answer

About speaker separation in "fast-transcription-api"

Dear Azure Support Team https://learn.microsoft.com/en-us/rest/api/speechtotext/transcriptions/transcribe?view=rest-speechtotext-2024-05-15-preview&tabs=HTTP The details of the TranscribeDefinition class are not described anywhere, so how should I do…

asked

y.ashibe 45

accepted

y.ashibe 45

0 answers

Transcription Denormalization.

Is there a way to "denormalize" Azure speech transcription, so it provides verbatim transcription (as close as possible, with word fillers, hesitations, repeats, etc)? I will also need word level timestamping and diarization. I am hoping there…

asked

Alex Cohen 0

commented

Alex Cohen 0

1 answer

Improving Accuracy of Azure Speech-to-Text with Continuous Language Identification

How can I improve the accuracy of language identification and speech-to-text (STT) capabilities in Azure Speech Service for my voice bot, which is experiencing issues with detecting English language and picking up background noise? PS - Based on common…

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

Issues with Recognizing Mixed Thai and English Audio in Azure Speech Service

How can I improve the accuracy of recognizing mixed Thai and English audio using Azure Speech Service? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help the Azure community.

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

Enhancing Multilingual Transcription Accuracy with Azure Speech Service

What steps can I take to improve the transcription accuracy of audio files that contain multiple languages using Azure Speech Service? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help…

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

How to Associate Client-Side Live Transcription Sessions with Logged Audio Files in Azure AI Services

How can I associate client-side live transcription sessions with the logged audio files in Azure AI services? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help the Azure community.

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

Troubleshooting WebRTC Disconnection in Avatar Service

What could be causing the disconnection issue when trying to start an avatar using WebRTC in Azure Cognitive Services-Speech Services, and how can it be resolved? PS - Based on common issues that we have seen from customers and other sources, we are…

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

Resolving Segmentation Fault with Azure Speech SDK and jemalloc

What steps can I take to resolve a segmentation fault when using jemalloc with the Azure Speech SDK in a Java application? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help the Azure…

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

503 Error When Downloading Azure TTS License for Disconnected Containers

What should I do if I am unable to download the TTS license for Azure speech disconnected containers and encounter a 503 error code? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help…

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

Resolving FetchDataError in Azure Speech Service

What steps should be taken when encountering a FetchDataError in the Azure Speech Service, causing a fatal error when accessing features? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to…

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

Limitation on Text-to-Speech Audio Length in Azure Cognitive Services

How can I generate audio files longer than 10 minutes using Azure Cognitive Services' Text-to-Speech API? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help the Azure community.

asked

santoshkc 6,960 Microsoft Vendor

answered

santoshkc 6,960 Microsoft Vendor

1 answer

Unable to Save Lexicon in Azure Cognitive Services

Why am I unable to save a lexicon in Azure Cognitive Services, and how can I resolve this issue? PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help the Azure community.

asked

santoshkc 6,960 Microsoft Vendor

edited an answer

santoshkc 6,960 Microsoft Vendor

0 answers

Do you have any suggestions or assistance in using the speech to text function to recognize homophones that may cause errors.

like for Chinese "枯(Ku)"，recognized as "哭(Ku)".Cannot contact context. This is just a probabilistic issue.

asked

连博10335043 0

commented

dupammi 8,185 Microsoft Vendor

0 answers

How to read English words aloud in syllables by text-to-speech? The purpose is to make videos of memorizing English words.

It feels like these sounds are meant to optimize the reading of complete sentences, but they can't read words in detail by syllables.

asked

sxmud 0

commented

d k 0

Filter

Content

1,593 questions with Azure AI Speech tags

Logs Not Generated for Custom Model in Azure Speech Services

Training German Audio Files in Azure Custom Speech Fails

How to Monitor Speech Service Models and Understand Rate Limits in Azure

Ensuring Uninterrupted Speech Services During Azure Failover Scenarios

Fixing Premature Session Stopping in Azure Speech SDK for Long Audios

I want to use the api

About speaker separation in "fast-transcription-api"

Transcription Denormalization.

Improving Accuracy of Azure Speech-to-Text with Continuous Language Identification

Issues with Recognizing Mixed Thai and English Audio in Azure Speech Service

Enhancing Multilingual Transcription Accuracy with Azure Speech Service

How to Associate Client-Side Live Transcription Sessions with Logged Audio Files in Azure AI Services

Troubleshooting WebRTC Disconnection in Avatar Service

Resolving Segmentation Fault with Azure Speech SDK and jemalloc

503 Error When Downloading Azure TTS License for Disconnected Containers

Resolving FetchDataError in Azure Speech Service

Limitation on Text-to-Speech Audio Length in Azure Cognitive Services

Unable to Save Lexicon in Azure Cognitive Services

Do you have any suggestions or assistance in using the speech to text function to recognize homophones that may cause errors.

How to read English words aloud in syllables by text-to-speech? The purpose is to make videos of memorizing English words.