dedicated pool of ASR engines (100 – 200) on standby
The customer is using real-time speech transcription by using custom endpoints and customer is requesting for is a dedicated pool of ASR engines (100 – 200) on standby, specific to judiciary’s usage and not for any other customer’s usage. The customer…
iOS version is using Microsoft TTS SDK occurs an error
Hello, our iOS version is using Microsoft TTS SDK, the version is: pod 'MicrosoftCognitiveServicesSpeech-iOS', '~> 1.35.0' When calling the official demo, an error occurred, specifically: func synthesisToSpeaker() { var speechConfig:…
When will new voices support blendshape output?
Hello, we are using the text-to-speech service and are relying on blendshapes for facial animations. However, some voices do not support blendshapes and this doesn't seem to be documented. In the voices overview…
Processing customer service calls in Hebrew
How can I transcribe and extract a to-do list from phone calls to a car service company in Hebrew? I need to transcribe the call, summarize the call, create a to-do list for the salesperson, and identify any necessary business procedures that should…
message: Acoustic data import failed: Zero transcriptions could be parsed from the given input.
In the Speech Studio, I'm trying to train a custom model. I'm using this folder as the template for my zip file. This is the error I get: Number of success: 0 Number of failure: 1 Error message: [ { message: Acoustic data import…
How to fix azure cognitive speech services error 0x38
I'm making a python applications with four scripts, everything works fine in vscode, but when I use the onefile command with all necessary libaries and stuff, it doesn't work it gives me 0x38. I'm using azure's functions to turn speech into text. Here's…
Custom list phrase / vocabulary on batch transcriptions?
Hi, I need the ability to provide a custom list of phrases for every transcription depending on the customer who will be transcribing a file. Consequently, I need something like this …
Is it possible to implement using NodeJS Microsoft SDK, real-time streaming and viseme events?
Hi all, I would like to know is it possible to implement a Microsoft SDK/NodeJS based app for text-to-speech using reali-time streaming (meaning that the server/client starts playback as soon as the first chunk is received) and having access to viseme…
Endpoint with custom model returns different result to Speech Studio
I have created a custom model in Speech Studio that uses sample text and structured text. I have uploaded some test samples into Speech Studio and have tested the model against these samples. I then deployed the custom model as an endpoint and am…
Detect and Select Microphone Input Device for the Azure Speech Recognition (Speech To Text) cloud service in Unity
Hello, After reading all the documentation and studying an example that used NAudio to detect and select audio input devices, I noticed that NAudio does not work properly in Unity. Also, I tried feeding a series of audio samples from Unity to Azure's…
How to get speaker identification in speech translation code (using MS Cognitive Services)?
I want to perform speaker identification in speech translation code (using MS Cognitive Services) in a way similar to the speech transcription code in the following (via accessing the SpeakerId property): …
How to gracefully handle error from Azure text to speech?
import azure.cognitiveservices.speech as speechsdk import os import random import sentry_sdk from app.common.constants import END_OF_STREAM from app.common.utils import TimeIt, is_debug_mode, capture_exception class AzureTTS: def __init__(self, …
Reuse SpeechRecognizer and stream for multiple audio streams?
Hi team, is there any best practice on how to reuse the SpeechRecognizer for stream recognizing user audios? In our application, we know where user start talking and end talking so we can signal speech recognizer for it. The reason I wanted to reuse…
Is it possible to change speech recognition parameters in "Recognizing" or "Recognized" handlers?
Hi I'm having the callbacks for Recognizing and Recognized handlers for the speech recognition, also, I have keyword recognition and continues recognition. Is there a possibility to update recognition parameters in those callbacks? Use case scenario is…
How to use Azure Speech to text display text format features in Python?
Hi team, I am following this link for setting ITN, punctuation: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/display-text-format?pivots=programming-language-python However I couldn't find any related code snippet or samples in…
transcribe real time during twilio phone call?
Hello, I'm able to make a call from twilio, once the call ends I'm passing .wav file to azure Speech To Text, I feel it's taking a lot of time transcribing data. Is there anyway during phone call itself we can transcribe or any other approach we can…
Request for Support in Developing a Neural TTS System in Uzbek Language
Dear Azure Speech Studio Support Team, I hope this message finds you well. I am writing to express my keen interest in developing a neural Text-to-Speech (TTS) system utilizing Azure Speech Studio, specifically tailored for the Uzbek language. My…
批量文本转语音,我记得之前我看文档说只有部分地区可以使用此api,但是现在没找到相关限制了,现在所有地区都可以调用批量文本转语音的api了吗
批量文本转语音,我记得之前我看文档说只有部分地区可以使用此api,但是现在没找到相关限制了,现在所有地区都可以调用批量文本转语音的api了吗? Batch text to voice, I remember before I read the document said that only some areas can use this api, but now I did not find the relevant restrictions, now all regions can call…
Persistent Issue with Azure Text-to-Speech: Missing Initial Words in Sentences
I'm encountering a recurring issue with Azure's Text-to-Speech service, where it consistently fails to include the first few words of every sentence in the generated voice output. This problem persists regardless of the specific text being synthesized.…
Can I use voice gallery to customize my own voice? How to make it, the production cycle, and how much I charge.
Can I use voice gallery to customize my own voice? How to make it, the production cycle, and how much I charge. please show me, how to make it, i want to do my own voice !