1,469 questions with Azure AI Speech tags

Sort by: Updated
1 answer

tts cant read "Cоmmuniсаtiоn is often dоnе without our own соnѕсiоuѕ awareness"

Hello everyone, im using the azure tts and encountered a weird problem. i tried testing the following line "Cоmmuniсаtiоn is often dоnе without our own соnѕсiоuѕ awareness" on the various voices available in the voice catalog. so far all i…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-06-12T09:12:38.98+00:00
executer 0 Reputation points
commented 2024-06-13T11:31:33.63+00:00
santoshkc 5,730 Reputation points Microsoft Vendor
0 answers

Use Azure Speech through a fixed public IP

A customer wishes to utilize the Azure Speech service via the internet while reducing the number of IP addresses that must be unblocked by their firewall. I tried to do that by adding a virtual network to my speech resource, creating a public IP, and…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-06-11T18:38:08.1833333+00:00
Julien S 1 Reputation point
commented 2024-06-12T14:55:35.1333333+00:00
romungi-MSFT 43,341 Reputation points Microsoft Employee
1 answer

En- NG availability on Embedded speech

HI i would like to request availability of English - Nigeria variant for embedded TTS

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
Azure
Azure
A cloud computing platform and infrastructure for building, deploying and managing applications and services through a worldwide network of Microsoft-managed datacenters.
1,024 questions
asked 2024-06-11T11:29:50.2833333+00:00
Ankit Agarwal 0 Reputation points
commented 2024-06-12T11:19:33.91+00:00
santoshkc 5,730 Reputation points Microsoft Vendor
0 answers

Usage cost calculation using Azure Retail Prices API for Azure Speech to Text and Blob Storage

We are using Azure subscription with the Standard Tier. We have a requirement to calculate the monthly usage cost in JPY (Japanese Yen) of the Azure Speech to Text service and Azure Blob Storage in our application. we analyzed the Azure Retail Price API…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
2,814 questions
Azure Cost Management
Azure Cost Management
A Microsoft offering that enables tracking of cloud usage and expenditures for Azure and other cloud providers.
2,202 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
2,539 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,508 questions
asked 2024-06-05T04:22:36.4733333+00:00
Test Admin 171 Reputation points
commented 2024-06-12T11:03:39.01+00:00
dupammi 7,745 Reputation points Microsoft Vendor
0 answers

The Cognitive Services Speech SDK has no sound on iPhone's Safari, but can play successfully on Mac's Safari. How should this be handled?

The Cognitive Services Speech SDK has no sound on iPhone's Safari, but can play successfully on Mac's Safari. How should this be handled? (in react) const initializeSynthesizer = () => { const speechConfig =…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-06-11T11:25:35.4866667+00:00
jessebo 0 Reputation points
edited a comment 2024-06-12T05:53:09.2733333+00:00
navba-MSFT 18,575 Reputation points Microsoft Employee
0 answers

In, e.g., 0001.sentence.json, quotation marks present in the original sentence are dropped, if that quotation mark occurs at the beginning or end of the detected sentence. Is this expected behavior?

This is mostly in the title. Initially, I suspected this was a bug in the JSON serialization since JSON also uses " to delimit its fields, and these also have to be escaped in SSML. Upon further investigation, however, i found it also affects…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-06-01T05:49:55.17+00:00
Verbari LLC 20 Reputation points
commented 2024-06-11T14:19:32.7733333+00:00
romungi-MSFT 43,341 Reputation points Microsoft Employee
0 answers

Custom neural voice data size is at 0 after training. Should I deploy the model?

Hello, We prepared 1749 utterances in order to create a Custom Neural Voice. In Step 3, which is "Train Model", it identified these 1749 utterances and suggested 25 hours of training time (see image attached). The training has finished in over…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-06-06T08:01:43.4133333+00:00
PAVAGEAU Perrine 80 Reputation points
edited a comment 2024-06-11T10:58:45.84+00:00
PAVAGEAU Perrine 80 Reputation points
0 answers

Can some voices on spx text to speech not read phonetic alphabet ?

Hello ! I am using the Azure text to speech service with SSML to read phonetic alphabet, it works well except for when I pick the voice "Andrew multilingual". The spx command does not generate any voice but there is no error in the output. Are…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,508 questions
asked 2024-06-10T20:36:03.0866667+00:00
Houda 0 Reputation points
edited a comment 2024-06-11T07:51:21.9266667+00:00
Houda 0 Reputation points
0 answers

How to receive a real-time audio stream using Websocket in Spring boot with SDK

Hello. This is really driving me crazy. Send Audio Stream from the Web Client to the Server The server must convert Stream to Text using the SDK. However, the stream in wav format does not appear to be being sent from the client to the server. I…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-06-09T09:41:21.25+00:00
김동윤 0 Reputation points
commented 2024-06-10T21:17:01.96+00:00
VasaviLankipalle-MSFT 15,241 Reputation points
1 answer One of the answers was accepted by the question author.

How to use an Microsoft Entra ID to authenticate with the Speech to text REST API (for batch transcription

I looks like you can only authenticate to the "Speech to text REST API" with a api key (Ocp-Apim-Subscription-Key). What we would like is to authenticate with a Microsoft Entra ID. Why? Our application is running a AKS and all our containers…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-05-10T13:45:07.2666667+00:00
Johan Klijn 61 Reputation points
accepted 2024-06-10T18:35:02.3266667+00:00
Johan Klijn 61 Reputation points
0 answers

is there any way of accessing the sounds that are sent to speech sdk server

hi, I'm trying to make some ai assistant using speech SDK, device is Linux kernel based, and I've configured Alsa loopback and Pulseaudio to utilize the echo cancellation feature which should be supported by SDK. One thing that I noticed is that…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-06-06T06:16:28.6233333+00:00
Faris Lemes 40 Reputation points
commented 2024-06-07T10:28:42.6533333+00:00
dupammi 7,745 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

Understanding "standard paid (S0)" pricing for "Audio Content Creation"

If I created a "speech service" with "standard paid (S0)", and I am only and only going to use "Audio Content Creation". What are going to be the pricing for it ? Will the free quota going to be included ? (500k characters)…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-06-06T18:17:14.8033333+00:00
Abdelrahman Mokhtar 40 Reputation points
accepted 2024-06-07T00:26:59.0333333+00:00
Abdelrahman Mokhtar 40 Reputation points
1 answer One of the answers was accepted by the question author.

I cant access anything in "Audio Content Creation", error "You don't have operation permissions"

I just created a speech service, but when I go to "Audio Content Creation", I can't do anything (New - Upload - Export) I tried to add myself as owner role, and other roles, but still, I can't do anything in Audio Content Creation.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-06-06T02:54:44.3266667+00:00
Abdelrahman Mokhtar 40 Reputation points
accepted 2024-06-06T18:19:23.3033333+00:00
Abdelrahman Mokhtar 40 Reputation points
1 answer One of the answers was accepted by the question author.

Will Azure AI Speech generate styles such as "happy", "cheerful", "excited" automatically from the data given?

I've added data with about 750 utterances. 80% are normal sentences, while 10% are questions and the other 10% are exclamations. What will Speech Studio need to generate styles such as Happy, Cheerful, etc? Do I have to give it more data? Or will…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-05-31T14:39:03.0333333+00:00
PAVAGEAU Perrine 80 Reputation points
accepted 2024-06-06T12:02:28.4+00:00
PAVAGEAU Perrine 80 Reputation points
1 answer One of the answers was accepted by the question author.

Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS

Subject: Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS Description: The Azure Neural TTS system is mispronouncing the Welsh contraction "i’w." Instead of producing the correct pronunciation…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-05-16T14:22:18.8166667+00:00
Verbari LLC 20 Reputation points
accepted 2024-06-06T06:00:48.4133333+00:00
Verbari LLC 20 Reputation points
0 answers

Speech Studio "Text to Speech" not respecting <break> markup

The text to speech renderer fails to apply the "break" markup in the Audio Content Creation interface of the Speech Studio. I haven't tried other markup. Yesterday, it didn't work with RyanMultinationalNeural, but worked with AndrewNeural. Now…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-06-05T17:33:51.8066667+00:00
Roy Jensen 20 Reputation points
edited a comment 2024-06-06T03:35:59.72+00:00
Roy Jensen 20 Reputation points
1 answer

Why am I getting a quota error?

I'm using Azure TTS and getting the following quota error: "You have reached the quota with your free-tier (F0) Speech resource. To continue to create audios with neural voices, switch to a standard paid resource, or upgrade your free-tier…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-03-05T20:15:33.4966667+00:00
Rich Hawksworth 0 Reputation points
answered 2024-06-05T16:11:12.9833333+00:00
ck ong 0 Reputation points
1 answer

No module named 'azure' when using azure.cognitiveservices.speech

Hello, I have a problem with importing azure.cognitiveservices.speech. I pip install the package but when importing it I got this error. ModuleNotFoundError: No module named 'azure'

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-05-16T09:53:03.8766667+00:00
Mosub Gamal Ali Soliman Lawash 0 Reputation points
commented 2024-06-05T16:04:24.1733333+00:00
AshokPeddakotla-MSFT 29,651 Reputation points
0 answers

How to transcribe silences to train a custom STT model?

Hey! 🙂 I'm about to fine-tune a STT model with Audio + human-labeled transcript data. I've gone through the docs and I'm pretty confident that I've the right use case for this type of custom model training. Also, I already know how to organize the data…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-06-03T11:55:33.11+00:00
Bruno Goncalves Vaz (P) 20 Reputation points
edited a comment 2024-06-04T11:01:15.9266667+00:00
Bruno Goncalves Vaz (P) 20 Reputation points
0 answers

Speech-to-Text batch transcribe API in germanycentralwest doesn't work

Last Friday (May 31 2024) we started getting the following errors on all transcripts sent to the batch transcription API on our speech resource in…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,469 questions
asked 2024-06-02T20:47:26.57+00:00
Matej the Mete 20 Reputation points
commented 2024-06-04T08:16:39.1566667+00:00
santoshkc 5,730 Reputation points Microsoft Vendor