1,392 questions with Azure AI Speech tags

Sort by: Updated
1 answer

Custom list phrase / vocabulary on batch transcriptions?

Hi, I need the ability to provide a custom list of phrases for every transcription depending on the customer who will be transcribing a file. Consequently, I need something like this …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2021-04-16T16:28:46.453+00:00
Rafael Castelo 6 Reputation points
commented 2024-04-13T14:51:39.6833333+00:00
Christopher Parsons 0 Reputation points
1 answer One of the answers was accepted by the question author.

Is it possible to implement using NodeJS Microsoft SDK, real-time streaming and viseme events?

Hi all, I would like to know is it possible to implement a Microsoft SDK/NodeJS based app for text-to-speech using reali-time streaming (meaning that the server/client starts playback as soon as the first chunk is received) and having access to viseme…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-11T13:29:56.0133333+00:00
Stamatis Kourtis 20 Reputation points
commented 2024-04-13T04:42:08.78+00:00
navba-MSFT 17,110 Reputation points Microsoft Employee
0 answers

Endpoint with custom model returns different result to Speech Studio

I have created a custom model in Speech Studio that uses sample text and structured text. I have uploaded some test samples into Speech Studio and have tested the model against these samples. I then deployed the custom model as an endpoint and am…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-12T05:24:14.1666667+00:00
van Boheemen, Matthew 1 Reputation point
commented 2024-04-13T04:19:25.8166667+00:00
van Boheemen, Matthew 1 Reputation point
1 answer

Detect and Select Microphone Input Device for the Azure Speech Recognition (Speech To Text) cloud service in Unity

Hello, After reading all the documentation and studying an example that used NAudio to detect and select audio input devices, I noticed that NAudio does not work properly in Unity. Also, I tried feeding a series of audio samples from Unity to Azure's…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-06T12:33:19.2233333+00:00
D4N005H 0 Reputation points
edited a comment 2024-04-12T22:54:42.9133333+00:00
D4N005H 0 Reputation points
2 answers One of the answers was accepted by the question author.

How to get speaker identification in speech translation code (using MS Cognitive Services)?

I want to perform speaker identification in speech translation code (using MS Cognitive Services) in a way similar to the speech transcription code in the following (via accessing the SpeakerId property):                …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-10T22:40:40.86+00:00
Mitch Clark 20 Reputation points
accepted 2024-04-12T19:36:14.27+00:00
Mitch Clark 20 Reputation points
0 answers

How to gracefully handle error from Azure text to speech?

import azure.cognitiveservices.speech as speechsdk import os import random import sentry_sdk from app.common.constants import END_OF_STREAM from app.common.utils import TimeIt, is_debug_mode, capture_exception class AzureTTS: def __init__(self, …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,375 questions
asked 2024-04-05T15:48:20.68+00:00
LeetGPT 60 Reputation points
commented 2024-04-12T16:16:36.76+00:00
dupammi 6,315 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

Reuse SpeechRecognizer and stream for multiple audio streams?

Hi team, is there any best practice on how to reuse the SpeechRecognizer for stream recognizing user audios? In our application, we know where user start talking and end talking so we can signal speech recognizer for it. The reason I wanted to reuse…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
941 questions
asked 2024-04-04T15:16:16.24+00:00
LeetGPT 60 Reputation points
accepted 2024-04-12T14:53:26.2+00:00
LeetGPT 60 Reputation points
0 answers

Is it possible to change speech recognition parameters in "Recognizing" or "Recognized" handlers?

Hi I'm having the callbacks for Recognizing and Recognized handlers for the speech recognition, also, I have keyword recognition and continues recognition. Is there a possibility to update recognition parameters in those callbacks? Use case scenario is…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-09T05:03:07.7566667+00:00
Faris Lemes 20 Reputation points
commented 2024-04-12T06:47:12.9266667+00:00
Faris Lemes 20 Reputation points
0 answers

FileNotFoundError: Could not find module 'C:\Users\ATIF ALTAF\OneDrive\Desktop\Adil\Check\.venv\lib\site-packages\azure\cognitiveservices\speech\Microsoft.CognitiveServices.Speech.core.dll' (or one of its dependencies). Try using the full path with constr

I'm trying to install pip install azure-cognitiveservices-speech and use it by this: import azure.cognitiveservices.speech as speechsdk. but it gives me error. FileNotFoundError: Could not find module 'C:\Users\ATIF…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-03-27T18:10:11.76+00:00
Atif Altaf 0 Reputation points
edited the question 2024-04-11T16:32:31.6266667+00:00
VasaviLankipalle-MSFT 14,181 Reputation points
1 answer One of the answers was accepted by the question author.

How to use Azure Speech to text display text format features in Python?

Hi team, I am following this link for setting ITN, punctuation: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/display-text-format?pivots=programming-language-python However I couldn't find any related code snippet or samples in…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-10T02:07:05.78+00:00
LeetGPT 60 Reputation points
commented 2024-04-11T10:08:26.0466667+00:00
dupammi 6,315 Reputation points Microsoft Vendor
1 answer

transcribe real time during twilio phone call?

Hello, I'm able to make a call from twilio, once the call ends I'm passing .wav file to azure Speech To Text, I feel it's taking a lot of time transcribing data. Is there anyway during phone call itself we can transcribe or any other approach we can…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
Azure Communication Services
Azure Communication Services
An Azure communication platform for deploying applications across devices and platforms.
679 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,375 questions
asked 2023-10-13T12:32:25.3733333+00:00
Rakesh Indla 5 Reputation points
commented 2024-04-11T09:17:55.42+00:00
Gobillion YC S21 0 Reputation points
1 answer

Request for Support in Developing a Neural TTS System in Uzbek Language

Dear Azure Speech Studio Support Team, I hope this message finds you well. I am writing to express my keen interest in developing a neural Text-to-Speech (TTS) system utilizing Azure Speech Studio, specifically tailored for the Uzbek language. My…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-09T12:26:35.6566667+00:00
Otabek Otamurodov 0 Reputation points
commented 2024-04-11T08:23:55.5266667+00:00
santoshkc 4,100 Reputation points Microsoft Vendor
1 answer

批量文本转语音,我记得之前我看文档说只有部分地区可以使用此api,但是现在没找到相关限制了,现在所有地区都可以调用批量文本转语音的api了吗

批量文本转语音,我记得之前我看文档说只有部分地区可以使用此api,但是现在没找到相关限制了,现在所有地区都可以调用批量文本转语音的api了吗? Batch text to voice, I remember before I read the document said that only some areas can use this api, but now I did not find the relevant restrictions, now all regions can call…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-06T09:47:57.9433333+00:00
佳鑫 朱 0 Reputation points
commented 2024-04-10T05:47:52.9666667+00:00
navba-MSFT 17,110 Reputation points Microsoft Employee
1 answer

Persistent Issue with Azure Text-to-Speech: Missing Initial Words in Sentences

I'm encountering a recurring issue with Azure's Text-to-Speech service, where it consistently fails to include the first few words of every sentence in the generated voice output. This problem persists regardless of the specific text being synthesized.…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-07T19:32:43.19+00:00
Rukshan 0 Reputation points
answered 2024-04-10T03:43:15.11+00:00
dupammi 6,315 Reputation points Microsoft Vendor
1 answer

Can I use voice gallery to customize my own voice? How to make it, the production cycle, and how much I charge.

Can I use voice gallery to customize my own voice? How to make it, the production cycle, and how much I charge. please show me, how to make it, i want to do my own voice !

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-06T14:40:08.7566667+00:00
#LIU CHANG# 0 Reputation points
edited a comment 2024-04-10T01:31:55.6133333+00:00
dupammi 6,315 Reputation points Microsoft Vendor
0 answers

Azure text to speech wordboundary event always returns zero for audio offset and duration

I have a call back connected to the wordboundary event and it was working okay until a few days ago. Now the event always returns 0 for audio offset and duration but the audio itself is fine. I'm using azure-cognitiveservices-speech 1.36.0. Problem…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-09T06:12:39.15+00:00
Matt Ma 0 Reputation points
commented 2024-04-09T20:58:13.7666667+00:00
VasaviLankipalle-MSFT 14,181 Reputation points
0 answers

Illegal Invocation Error When Using Speech SDK in Cloudflare Workers Environment

I am encountering an Illegal invocation error when trying to use microsoft-cognitiveservices-speech-sdk within a Cloudflare Workers environment. The same code works as expected in a Node.js environment, but it fails when deployed to Cloudflare Workers. …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-09T04:47:20.11+00:00
SonBs 0 Reputation points
commented 2024-04-09T20:18:07.3833333+00:00
VasaviLankipalle-MSFT 14,181 Reputation points
0 answers

Azure Neural TTS Web Player instance and plugin for React

Hello! After reading the post Azure Neural TTS Web Player: let your website speak for itself,I recently sent an email to ttsplayer@microsoft.com requesting for an Azure Neural TTS Web Player for my website. In the post, the author directed readers…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,375 questions
asked 2024-04-03T15:17:08.5433333+00:00
Henry Chan 0 Reputation points
edited a comment 2024-04-06T11:20:44.67+00:00
Henry Chan 0 Reputation points
1 answer

Pronouncing the words "Hi" and "Fine" incorrectly when using Multilingual voices

When I use Multilingual voices (like Emma Multilingual, Andrew Multilingual, Jenny Multilingual,...) in English for text-to-speech, the output is mispronouncing the single word "Hi" or "Fine". Please help me to fix it. Waiting for…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-04-04T01:28:18.2466667+00:00
Ngoc Thai Tran 0 Reputation points
commented 2024-04-06T01:58:14.7066667+00:00
dupammi 6,315 Reputation points Microsoft Vendor
2 answers One of the answers was accepted by the question author.

Why my TTS is suddenly become bad? Speed & punctuation isn't working properly.

This morning I tried to work on my TTS file using Brian's voice. But once I listened to the speech, the punctuation & speed weren't working properly. Also, it seems that his voice became monotone. I've tried with an already-finished project to see if…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,392 questions
asked 2024-03-23T14:18:35.1833333+00:00
etienne Brassard 25 Reputation points
edited a comment 2024-04-05T18:41:25.29+00:00
Kit Chan 5 Reputation points