1,413 questions with Azure AI Speech tags

Sort by: Updated
1 answer One of the answers was accepted by the question author.

dedicated pool of ASR engines (100 – 200) on standby

The customer is using real-time speech transcription by using custom endpoints and customer is requesting for is a dedicated pool of ASR engines (100 – 200) on standby, specific to judiciary’s usage and not for any other customer’s usage. The customer…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-15T01:33:35.2566667+00:00
Mallu Swetha (MINDTREE LIMITED) 80 Reputation points Microsoft Vendor
accepted 2024-04-16T06:16:56.9566667+00:00
Mallu Swetha (MINDTREE LIMITED) 80 Reputation points Microsoft Vendor
1 answer

iOS version is using Microsoft TTS SDK occurs an error

Hello, our iOS version is using Microsoft TTS SDK, the version is: pod 'MicrosoftCognitiveServicesSpeech-iOS', '~> 1.35.0' When calling the official demo, an error occurred, specifically: func synthesisToSpeaker() { var speechConfig:…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-16T02:08:01.0066667+00:00
newsay 25 Reputation points
answered 2024-04-16T03:16:38.12+00:00
navba-MSFT 17,365 Reputation points Microsoft Employee
1 answer

When will new voices support blendshape output?

Hello, we are using the text-to-speech service and are relying on blendshapes for facial animations. However, some voices do not support blendshapes and this doesn't seem to be documented. In the voices overview…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-05T13:26:54.0966667+00:00
Steffen Schreiber 20 Reputation points
answered 2024-04-15T20:49:52.27+00:00
Steffen Schreiber 20 Reputation points
1 answer

Processing customer service calls in Hebrew

How can I transcribe and extract a to-do list from phone calls to a car service company in Hebrew? I need to transcribe the call, summarize the call, create a to-do list for the salesperson, and identify any necessary business procedures that should…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-15T15:28:37.42+00:00
Shahar Spencer 60 Reputation points
edited an answer 2024-04-15T20:32:07.0266667+00:00
VasaviLankipalle-MSFT 14,576 Reputation points
0 answers

message: Acoustic data import failed: Zero transcriptions could be parsed from the given input.

In the Speech Studio, I'm trying to train a custom model. I'm using this folder as the template for my zip file. This is the error I get: Number of success: 0 Number of failure: 1 Error message: [ { message: Acoustic data import…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2022-07-28T17:48:00.363+00:00
Prasanna 11 Reputation points
edited a comment 2024-04-15T08:34:08.95+00:00
Akihiro Inui 0 Reputation points Microsoft Employee
1 answer

How to fix azure cognitive speech services error 0x38

I'm making a python applications with four scripts, everything works fine in vscode, but when I use the onefile command with all necessary libaries and stuff, it doesn't work it gives me 0x38. I'm using azure's functions to turn speech into text. Here's…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-13T04:22:51.9266667+00:00
Olanorw 0 Reputation points
answered 2024-04-15T03:16:51.7866667+00:00
navba-MSFT 17,365 Reputation points Microsoft Employee
1 answer

Custom list phrase / vocabulary on batch transcriptions?

Hi, I need the ability to provide a custom list of phrases for every transcription depending on the customer who will be transcribing a file. Consequently, I need something like this …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2021-04-16T16:28:46.453+00:00
Rafael Castelo 6 Reputation points
commented 2024-04-13T14:51:39.6833333+00:00
Christopher Parsons 0 Reputation points
1 answer One of the answers was accepted by the question author.

Is it possible to implement using NodeJS Microsoft SDK, real-time streaming and viseme events?

Hi all, I would like to know is it possible to implement a Microsoft SDK/NodeJS based app for text-to-speech using reali-time streaming (meaning that the server/client starts playback as soon as the first chunk is received) and having access to viseme…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-11T13:29:56.0133333+00:00
Stamatis Kourtis 20 Reputation points
commented 2024-04-13T04:42:08.78+00:00
navba-MSFT 17,365 Reputation points Microsoft Employee
0 answers

Endpoint with custom model returns different result to Speech Studio

I have created a custom model in Speech Studio that uses sample text and structured text. I have uploaded some test samples into Speech Studio and have tested the model against these samples. I then deployed the custom model as an endpoint and am…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-12T05:24:14.1666667+00:00
van Boheemen, Matthew 1 Reputation point
commented 2024-04-13T04:19:25.8166667+00:00
van Boheemen, Matthew 1 Reputation point
1 answer

Detect and Select Microphone Input Device for the Azure Speech Recognition (Speech To Text) cloud service in Unity

Hello, After reading all the documentation and studying an example that used NAudio to detect and select audio input devices, I noticed that NAudio does not work properly in Unity. Also, I tried feeding a series of audio samples from Unity to Azure's…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-06T12:33:19.2233333+00:00
D4N005H 0 Reputation points
edited a comment 2024-04-12T22:54:42.9133333+00:00
D4N005H 0 Reputation points
2 answers One of the answers was accepted by the question author.

How to get speaker identification in speech translation code (using MS Cognitive Services)?

I want to perform speaker identification in speech translation code (using MS Cognitive Services) in a way similar to the speech transcription code in the following (via accessing the SpeakerId property):                …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-10T22:40:40.86+00:00
Mitch Clark 20 Reputation points
accepted 2024-04-12T19:36:14.27+00:00
Mitch Clark 20 Reputation points
0 answers

How to gracefully handle error from Azure text to speech?

import azure.cognitiveservices.speech as speechsdk import os import random import sentry_sdk from app.common.constants import END_OF_STREAM from app.common.utils import TimeIt, is_debug_mode, capture_exception class AzureTTS: def __init__(self, …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,416 questions
asked 2024-04-05T15:48:20.68+00:00
LeetGPT 60 Reputation points
commented 2024-04-12T16:16:36.76+00:00
dupammi 7,130 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

Reuse SpeechRecognizer and stream for multiple audio streams?

Hi team, is there any best practice on how to reuse the SpeechRecognizer for stream recognizing user audios? In our application, we know where user start talking and end talking so we can signal speech recognizer for it. The reason I wanted to reuse…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
979 questions
asked 2024-04-04T15:16:16.24+00:00
LeetGPT 60 Reputation points
accepted 2024-04-12T14:53:26.2+00:00
LeetGPT 60 Reputation points
0 answers

Is it possible to change speech recognition parameters in "Recognizing" or "Recognized" handlers?

Hi I'm having the callbacks for Recognizing and Recognized handlers for the speech recognition, also, I have keyword recognition and continues recognition. Is there a possibility to update recognition parameters in those callbacks? Use case scenario is…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-09T05:03:07.7566667+00:00
Faris Lemes 20 Reputation points
commented 2024-04-12T06:47:12.9266667+00:00
Faris Lemes 20 Reputation points
1 answer One of the answers was accepted by the question author.

How to use Azure Speech to text display text format features in Python?

Hi team, I am following this link for setting ITN, punctuation: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/display-text-format?pivots=programming-language-python However I couldn't find any related code snippet or samples in…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-10T02:07:05.78+00:00
LeetGPT 60 Reputation points
commented 2024-04-11T10:08:26.0466667+00:00
dupammi 7,130 Reputation points Microsoft Vendor
1 answer

transcribe real time during twilio phone call?

Hello, I'm able to make a call from twilio, once the call ends I'm passing .wav file to azure Speech To Text, I feel it's taking a lot of time transcribing data. Is there anyway during phone call itself we can transcribe or any other approach we can…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
Azure Communication Services
Azure Communication Services
An Azure communication platform for deploying applications across devices and platforms.
694 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,416 questions
asked 2023-10-13T12:32:25.3733333+00:00
Rakesh Indla 5 Reputation points
commented 2024-04-11T09:17:55.42+00:00
Gobillion YC S21 0 Reputation points
1 answer

Request for Support in Developing a Neural TTS System in Uzbek Language

Dear Azure Speech Studio Support Team, I hope this message finds you well. I am writing to express my keen interest in developing a neural Text-to-Speech (TTS) system utilizing Azure Speech Studio, specifically tailored for the Uzbek language. My…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-09T12:26:35.6566667+00:00
Otabek Otamurodov 0 Reputation points
commented 2024-04-11T08:23:55.5266667+00:00
santoshkc 4,435 Reputation points Microsoft Vendor
1 answer

批量文本转语音,我记得之前我看文档说只有部分地区可以使用此api,但是现在没找到相关限制了,现在所有地区都可以调用批量文本转语音的api了吗

批量文本转语音,我记得之前我看文档说只有部分地区可以使用此api,但是现在没找到相关限制了,现在所有地区都可以调用批量文本转语音的api了吗? Batch text to voice, I remember before I read the document said that only some areas can use this api, but now I did not find the relevant restrictions, now all regions can call…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-06T09:47:57.9433333+00:00
佳鑫 朱 0 Reputation points
commented 2024-04-10T05:47:52.9666667+00:00
navba-MSFT 17,365 Reputation points Microsoft Employee
1 answer

Persistent Issue with Azure Text-to-Speech: Missing Initial Words in Sentences

I'm encountering a recurring issue with Azure's Text-to-Speech service, where it consistently fails to include the first few words of every sentence in the generated voice output. This problem persists regardless of the specific text being synthesized.…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-07T19:32:43.19+00:00
Rukshan 0 Reputation points
answered 2024-04-10T03:43:15.11+00:00
dupammi 7,130 Reputation points Microsoft Vendor
1 answer

Can I use voice gallery to customize my own voice? How to make it, the production cycle, and how much I charge.

Can I use voice gallery to customize my own voice? How to make it, the production cycle, and how much I charge. please show me, how to make it, i want to do my own voice !

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,413 questions
asked 2024-04-06T14:40:08.7566667+00:00
#LIU CHANG# 0 Reputation points
edited a comment 2024-04-10T01:31:55.6133333+00:00
dupammi 7,130 Reputation points Microsoft Vendor