1,555 questions with Azure AI Speech tags

Sort by: Updated
0 answers

How to have the control over the audio playing when text is converted to speech using Azure Speech Service?

Below is the code I am using to convert text to audio for a button click using Azure speech service, but I am unable to stop the audio that is playing, I would like to use the same button to stop the audio while it is playing. How to have the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
Azure Translator
Azure Translator
An Azure service to easily conduct machine translation with a simple REST API call.
368 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,636 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,642 questions
asked 2024-07-08T06:17:54.4566667+00:00
Shivani V 0 Reputation points
commented 2024-07-11T04:36:52.03+00:00
dupammi 8,035 Reputation points Microsoft Vendor
0 answers

How to read English words aloud in syllables by text-to-speech? The purpose is to make videos of memorizing English words.

It feels like these sounds are meant to optimize the reading of complete sentences, but they can't read words in detail by syllables.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2024-07-09T05:27:41.88+00:00
sxmud 0 Reputation points
commented 2024-07-11T04:32:05.4166667+00:00
dupammi 8,035 Reputation points Microsoft Vendor
1 answer

Can't preview a sound on Speech Studio

It happens on East US, S0

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2024-07-09T16:25:54.4633333+00:00
Quill Zhou 25 Reputation points
answered 2024-07-10T20:01:28.8733333+00:00
VasaviLankipalle-MSFT 15,946 Reputation points
1 answer One of the answers was accepted by the question author.

Seeking Optimal Speech Transcription Service for Mixed Chinese and English Scenarios

Our speech recognition scenario mainly involves a mix of Chinese and English. Currently, we have chosen the Chinese language recognition type (as there is no specific type for mixed Chinese and English). Besides manually adding hotwords and conducting…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2024-07-10T10:19:21.5833333+00:00
hexarrior 40 Reputation points
accepted 2024-07-10T19:48:13.1166667+00:00
hexarrior 40 Reputation points
1 answer One of the answers was accepted by the question author.

Improving Speech to Text Accuracy for Industry-Specific Terminology with Azure AI Service

Hi all, I want to improve the accuracy of reading industry-specific terminology(in Japanese) using Azure AI service's Speech to Text. The challenge is that these terms can have different meanings in general contexts versus industry-specific contexts. How…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,642 questions
asked 2024-07-09T07:38:58.95+00:00
KT 150 Reputation points
commented 2024-07-09T09:30:50.9633333+00:00
KT 150 Reputation points
1 answer

no voice when I click "play" button to create speech from text

no voice when I click "play" button to create speech from text, my laptop voice turned on already.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,642 questions
asked 2024-07-05T07:22:27.8133333+00:00
Grace Xiong 0 Reputation points Microsoft Employee
commented 2024-07-09T09:01:55.4866667+00:00
santoshkc 6,955 Reputation points Microsoft Vendor
1 answer

How to fix an issue where my 3D Blendshapes do not align with the audio.

I'm trying to apply viseme 3D Blend Shapes to drive my 3d avatar.  When the result is returned, the audio plays before the response's FrameIndex and BlendShape. I received event.animation and used it to set the weight for each blend shape name.  However,…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2024-07-08T07:21:23.1533333+00:00
Ananchai Mankhong 0 Reputation points
commented 2024-07-08T17:42:37.0233333+00:00
Ananchai Mankhong 0 Reputation points
1 answer

Can I use phonetic language to create perfect speech

Can I use International Phonetic alphabetic translation in azure text to speech to come out with a near perfect speech? If so, how?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2023-12-06T06:55:10.1666667+00:00
Geoff Surtees 0 Reputation points
edited a comment 2024-07-08T12:58:32.9966667+00:00
Stefano Michieletto 0 Reputation points
0 answers

How can I use Whisper on Azure AI Speech

Hi, I recently switched from using the whisper model via Azure OpenAI to using Azure AI Speech. However, I noticed that the quality of some transcriptions is worse on Azure AI Speech. On the below page it says that it is possible to use the whisper model…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2024-07-04T18:25:53.3633333+00:00
Julian 0 Reputation points
commented 2024-07-08T11:03:44.17+00:00
dupammi 8,035 Reputation points Microsoft Vendor
1 answer

How to get sentence word timestamp results for real-time speech recognition ?

I am using Golang's SDK this is my golang code func (m *microsoft) Do(ctx context.Context, path string) (string, error) { defer os.Remove(path) accessKeyConfig := AccessKeyList[rand.Intn(len(AccessKeyList))] subscription := accessKeyConfig.Key region…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2024-07-05T06:25:10.16+00:00
莓 草 0 Reputation points
commented 2024-07-08T09:01:24.4366667+00:00
navba-MSFT 20,635 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Microsoft: fix captioning by Speech Studio

The captioning functionality in the Speech Studio is an utter failure. This is typical output: I encourage Microsoft to implement the functionality that allows the user to specify the number of lines of text (typically one or two), and the maximum…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2024-07-02T00:44:05.4533333+00:00
Roy Jensen 40 Reputation points
commented 2024-07-08T05:10:17.46+00:00
navba-MSFT 20,635 Reputation points Microsoft Employee
1 answer

create a basic voice-interactive dashboard

Hello Team, I need to create a basic voice-interactive dashboard using Azure Cognitive services like, Speech service, CLU(Conversational Language Understanding) & PowerBI.Also suggest if any other way to achieve this. It would be really helpful.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
Azure AI Language
Azure AI Language
An Azure service that provides natural language capabilities including sentiment analysis, entity extraction, and automated question answering.
390 questions
asked 2024-06-27T17:16:48.97+00:00
Vijayakumar Elumalai 105 Reputation points
commented 2024-07-07T08:09:54.2933333+00:00
YutongTie-MSFT 48,581 Reputation points
1 answer

Request to Increase Whisper Model Quota Limit

Hi Azure Community, I hope everyone is doing well. I am currently working on a project that requires a higher capacity of the Whisper model than my current Azure quota allows. I am seeking guidance on how to increase my Whisper model quota…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,642 questions
asked 2024-07-03T19:56:23.8133333+00:00
narayanam Srinivasulu 0 Reputation points
commented 2024-07-05T19:27:03.9066667+00:00
VasaviLankipalle-MSFT 15,946 Reputation points
1 answer

Azure AI Speech content filter

Hey everyone, I am using the Azure AI Speech api for real time transcription of conversations. The problem I am facing is that the content filter recognizes words such as the german 'dick' as offensive. This might be true in english, however in german…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2024-07-04T18:16:35.31+00:00
Julian 0 Reputation points
commented 2024-07-05T05:54:41.2666667+00:00
navba-MSFT 20,635 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Ingesting webpage URL for the open AI web app in Azure

Hi there. In the Azure open AI studio, there is an option for defining webpage URL when you add data for the app but based on the requirements in the Microsoft website, it can only extract text up tp 20 sublinks and also I can only put one URL in it. …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,636 questions
asked 2024-02-26T19:15:09.8766667+00:00
Jalali, Hadi 40 Reputation points
commented 2024-07-05T04:55:11.28+00:00
Mansi Gusain 0 Reputation points
0 answers

Failed to get HTTP platform singleton instance. Error: 27

Hello! I'm working with the Azure Speech Services SDK via python. The code worked well, until I started getting blank responses. Basically my request got cancelled, when checking the reason, I got this: #…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2024-07-04T14:37:35.1233333+00:00
Vitalii Brydinskyi 0 Reputation points
commented 2024-07-05T01:16:34.1666667+00:00
VasaviLankipalle-MSFT 15,946 Reputation points
0 answers

I use speech to text and want to transcribe the corresponding text, but it keeps timing out without successful recognition. Why is this happening?

this is my file,and download it https://feedback.meitudata.com/public/file/yASWSTNPh2RE3Ncv.wav

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2024-07-04T11:47:58.2133333+00:00
莓 草 0 Reputation points
commented 2024-07-04T19:39:11.02+00:00
VasaviLankipalle-MSFT 15,946 Reputation points
1 answer

Is each voice in the voice gallery based on a clone of one specific natural person or is it synthetic?

I would like to understand whether: Each voice in the voice gallery is based on a clone of one specific natural person? Voices are synthetic (similar to those from 11Labs Voice Design) that cannot be traced back to an individual person? Thank you!

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2024-07-02T07:18:37.07+00:00
mpsb 0 Reputation points
commented 2024-07-04T10:52:53.4666667+00:00
santoshkc 6,955 Reputation points Microsoft Vendor
1 answer

Azure speech speaker differentiation

Hi, I would like to use azure speech to transcribe a meeting, however i want it to differentiate between anonymous speakers, eg speaker A, speaker B. Is it possible to do that. Are there any samplesand tutorials out there that I can just take and use?…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,642 questions
asked 2024-07-04T08:05:43.6733333+00:00
jchoo 0 Reputation points
edited the question 2024-07-04T08:17:20.3633333+00:00
AmaranS 3,865 Reputation points Microsoft Vendor
0 answers

Is there a way to make speech service transcription faster (diarization with speakers differentiated)?

Currently the speed seems to be half the time for wav and 1:1 ratio for mp4 with gstreamer. From this post, it seems half the time for wav file is the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
asked 2024-07-02T05:30:12.4666667+00:00
kk 0 Reputation points
commented 2024-07-04T06:45:28.3466667+00:00
santoshkc 6,955 Reputation points Microsoft Vendor