1,435 questions with Azure AI Speech tags

Sort by: Updated
0 answers

Multilingual voice mispronounces Ukrainian as Russian

How can I resolve the issue of multilingual voices pronouncing Ukrainian as Russian when using Text to Speech with the Microsoft.CognitiveServices.Speech package in C#? Explicitly specifying the language in the code through the SpeechSynthesisLanguage…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-19T06:44:50.0966667+00:00
Serhii Kapin 0 Reputation points
0 answers

Phonemes are not available for pronunciation recognition in french

On the result of the pronunciation recognition, if we set to "en-US", we have all the results for the phonemes spoken/matches. As below. "Phonemes": [ { "Phoneme":…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-18T21:38:28.0266667+00:00
GOMES-ALVES-DOS-SANTOS Bruna 0 Reputation points
0 answers

Markdown to SSML ?

Does anyone know of a basic "preparer-converter" that takes a markdown (.md) file and converts it into an SSML file?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-18T19:23:52.8433333+00:00
ivo welch 20 Reputation points
0 answers

Speech Studio Audio Content Creation (x) Content Format and Audio Export Fail

I discovered https://speech.microsoft.com/portal, audio creation tile. (I think it should be the first one and described as "interactive batch TTS web interface.") I uploaded a file named test.txt, which has two paragraphs. For decades now,…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-18T19:20:53.55+00:00
ivo welch 20 Reputation points
1 answer One of the answers was accepted by the question author.

Batch TTS with REST: YourSynthesisId and other intro questions

I got the REST API to work on macos. Yeah!!! I could hear the output from the sample code. Alas, now I would like to submit a longer document I wrote to batch TTS and post it as my podcast. I am taking the example right off the webpage, and just…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-16T21:32:09.3333333+00:00
ivo welch 20 Reputation points
accepted 2024-05-18T19:09:24.1933333+00:00
ivo welch 20 Reputation points
0 answers

Cognitive services pronunciation assessment always gives 100% score, even with badly pronounced words

I built a svelte (javascript) application that uses the microsoft speech sdk (v1.36), and i am using it to evaluate pronunciation in 3 languages: english, german and french. Initially i was using RecognizeOnceAsync() which waits for silence at the end of…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-17T10:49:24.3766667+00:00
Schoolblocks 0 Reputation points
commented 2024-05-17T20:50:29.52+00:00
VasaviLankipalle-MSFT 14,831 Reputation points
0 answers

azure prononciation assessment time limit

i am using azure prononciation assessment to assess an audio , but the problem the assessment happens only for the 1 min of the speech and it doesnt assess the rest of the audio this is my code const sdk =…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-17T11:36:28.55+00:00
Iheb Jandoubi 5 Reputation points
commented 2024-05-17T18:28:20.5166667+00:00
romungi-MSFT 42,761 Reputation points Microsoft Employee
1 answer

Can you add a phrase list to the CallMediaRecognizeSpeechOptions class when using speech-to- text cognitive services from azure communications service

I am using ACS to access a multi-service Cognitive Services endpoint and doing recognition of speech input in real time via acs/telephone. I am using the default model provided by Microsoft. This is sufficient in most case but I have some place names…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
Azure Communication Services
Azure Communication Services
An Azure communication platform for deploying applications across devices and platforms.
704 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,435 questions
asked 2024-05-17T08:49:02.57+00:00
John 0 Reputation points
answered 2024-05-17T14:33:27.65+00:00
romungi-MSFT 42,761 Reputation points Microsoft Employee
0 answers

Is it possible to specify in Speech SDK to always use "lbs" instead of "£" when "pounds" is recognized?

Hi, is it possible somehow to configure speech sdk in a way when word "pound" is detected that it is always meant to be lbs, not £, for example when I say, "99 pounds" it is detected as "99 lbs", but if I said, "100…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-04-23T08:38:47.7566667+00:00
Faris Lemes 20 Reputation points
commented 2024-05-17T12:58:40.3666667+00:00
Faris Lemes 20 Reputation points
1 answer

here i cannot find To create a custom avatar endpoint, follow these steps: Sign in to Speech Studio. Navigate to Custom Avatar > Your project name > Train model.

i cannot find custom avatar key after sign in to the speech studio .

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-16T11:23:22.97+00:00
Praveen Jaganivasan 0 Reputation points
commented 2024-05-17T12:01:04.6633333+00:00
santoshkc 4,925 Reputation points Microsoft Vendor
1 answer

How to use an Microsoft Entra ID to authenticate with the Speech to text REST API (for batch transcription

I looks like you can only authenticate to the "Speech to text REST API" with a api key (Ocp-Apim-Subscription-Key). What we would like is to authenticate with a Microsoft Entra ID. Why? Our application is running a AKS and all our containers…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-10T13:45:07.2666667+00:00
Johan Klijn 41 Reputation points
commented 2024-05-17T12:00:35.7433333+00:00
navba-MSFT 17,395 Reputation points Microsoft Employee
1 answer

Issue with speech-to-text service

While converting the given wave file from Speech-to-Text using Microsoft's Speech-to-Text service, it is not detecting "No" at 57th second in this file but detecting at 1:12 min and in other places. Speech recognised is as follow RECOGNIZED:…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-16T06:18:15.6466667+00:00
Vidyadhar Busam 0 Reputation points
commented 2024-05-17T11:40:44.1833333+00:00
santoshkc 4,925 Reputation points Microsoft Vendor
1 answer

How to output transcription on a word-level

With the provided callback function, the text is outputted as described by you, either after a short pause or after a maximum of 15 seconds. Is it possible to output word by word so that the text can be seen while speaking? def…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-17T08:41:50.08+00:00
Sophie 0 Reputation points
answered 2024-05-17T09:11:07.3166667+00:00
Gowtham CP 1,010 Reputation points
1 answer One of the answers was accepted by the question author.

Set sound threshold for microsoft speech-to-text

Hi, It is possible setting a volume-threshold for the speech that gets transcribed? Such that if the speech is below a certain threshold then it would not get transcribed. I am using the speechSDK Br, Daniel

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2023-05-19T11:26:14.9833333+00:00
Daniel Beck Hansen 21 Reputation points
commented 2024-05-17T07:51:10.4066667+00:00
Amila Hapuarachchi 0 Reputation points
0 answers

Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS

Subject: Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS Description: The Azure Neural TTS system is mispronouncing the Welsh contraction "i’w." Instead of producing the correct pronunciation…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-16T14:22:18.8166667+00:00
Verbari LLC 0 Reputation points
commented 2024-05-17T05:43:08.5666667+00:00
navba-MSFT 17,395 Reputation points Microsoft Employee
0 answers

Inquiry Regarding Azure AI Speech Error

Dear Azure Support Team I recently encountered an issue while using Azure AI Speech service with recordings from the VoiceMemo app on iPhone. Specifically, when attempting to process recordings of approximately 30 minutes in length, I received the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-15T12:18:25.3266667+00:00
y.ashibe 0 Reputation points
edited a comment 2024-05-17T05:41:30.01+00:00
navba-MSFT 17,395 Reputation points Microsoft Employee
1 answer

macos cli starter guide

I am trying to play around with azure text to speech on macos. the instructions are woefully incomplete. I start with…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-15T22:32:07.6566667+00:00
Welch, Ivo 0 Reputation points
answered 2024-05-16T20:16:17.4966667+00:00
ivo welch 20 Reputation points
1 answer

Azure AI - Speech Studio - Error Message

Hi there, I receive this error message today. "为资源 xiaoshuoyuedu1 分配的角色尚未生效。 请让资源管理员配置__自定义子域__并启用 VNet 以使你的角色正常工作。" "The role assigned to resource xiaoshuoyuedu1 has not taken effect yet. Please have the resource administrator configure…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-15T02:43:48.61+00:00
Harb369 5 Reputation points
edited an answer 2024-05-16T17:53:45.07+00:00
romungi-MSFT 42,761 Reputation points Microsoft Employee
3 answers One of the answers was accepted by the question author.

Why my TTS is suddenly become bad? Speed & punctuation isn't working properly.

This morning I tried to work on my TTS file using Brian's voice. But once I listened to the speech, the punctuation & speed weren't working properly. Also, it seems that his voice became monotone. I've tried with an already-finished project to see if…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-03-23T14:18:35.1833333+00:00
etienne Brassard 25 Reputation points
commented 2024-05-16T14:55:07.53+00:00
Konstantinos Passadis 17,376 Reputation points MVP
0 answers

No module named 'azure' when using azure.cognitiveservices.speech

Hello, I have a problem with importing azure.cognitiveservices.speech. I pip install the package but when importing it I got this error. ModuleNotFoundError: No module named 'azure'

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,435 questions
asked 2024-05-16T09:53:03.8766667+00:00
Mosub Gamal Ali Soliman Lawash 0 Reputation points
commented 2024-05-16T10:56:55.5333333+00:00
AshokPeddakotla-MSFT 28,221 Reputation points