1,411 questions with Azure AI Speech tags

Sort by: Updated
0 answers

Queries Regarding Azure OpenAI Integration: DALL-E & GPT-Vision Availability, Document & Image Upload Issues, and Translation Services

Question 1: Availability of DALL-E and GPT-Vision in India When can we expect DALL-E and GPT-Vision to become available in India? Question 2: Document Upload Issue with Azure OpenAI We've encountered an issue uploading documents on Chat Playground and…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
Azure Computer Vision
Azure Computer Vision
An Azure artificial intelligence service that analyzes content in images and video.
316 questions
Azure Translator
Azure Translator
An Azure service to easily conduct machine translation with a simple REST API call.
345 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,407 questions
asked 2024-05-08T07:09:30.22+00:00
Niket Kumar Singh 190 Reputation points
0 answers

SSML: Using <lang xml:lang=""> within a multilingual voice sounds incorrect / unlike when used with the language-specific voice

I am developing a TTS application that pronounces "nonsense words" with specific language pronunciations. For example, I am using Polish language voices to pronounce non-Polish words. If I use a Polish-specific language, I hear what I expect…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-05T16:18:51.91+00:00
mkb13 11 Reputation points
commented 2024-05-08T07:02:55.93+00:00
dupammi 6,815 Reputation points Microsoft Vendor
0 answers

azure prononciation assessment async assessment

i'am using azure speech recognizer sdk , to do the prononciation assessment of an audio file. the problem when the speech is in french the results are always low , and no expressive const language = await detectSingleSpeechLanguage(text) …

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-07T17:04:34.3266667+00:00
Iheb Jandoubi 5 Reputation points
edited the question 2024-05-08T06:55:04.7166667+00:00
romungi-MSFT 42,286 Reputation points Microsoft Employee
1 answer

When will more avatar's be available?

The Text to Speech Avatar has been in preview for about six months. Any idea when a full release will be done? And what will be in that release? additional avatars adjustable clothing, hair, skin tone, ... ??? Thx

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-08T04:15:01.0133333+00:00
Roy Jensen 0 Reputation points
answered 2024-05-08T06:34:49.25+00:00
dupammi 6,815 Reputation points Microsoft Vendor
0 answers

How to get spoken Language in audio file with Azure Speech sdk in C#?

Hi, I need to detect what's spoken language in an audio file. I have already read the documentation about language identification for speech service but in the SpeechRecognitionResult object result I don't have the recognized language code. Is there a…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,407 questions
asked 2024-05-07T10:06:08.93+00:00
Matteo Gianfermi 0 Reputation points
commented 2024-05-08T00:07:07.0766667+00:00
VasaviLankipalle-MSFT 14,576 Reputation points
1 answer One of the answers was accepted by the question author.

Retirement Announcement - Upgrade to Text-to-Speech Neural Voice on 31 August 2024

Text-to-Speech currently supports both standard and neural voices. However, since the neural voices provide more natural sounding speech output, and thus, a better end-user experience, we are retiring the standard voices on 31st August 2024 and they will…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2021-08-27T06:03:34.957+00:00
romungi-MSFT 42,286 Reputation points Microsoft Employee
commented 2024-05-07T13:36:12.67+00:00
Vinod Mankare 0 Reputation points
0 answers

Error while trying to train a 202240228 Whisper Large v2 baseline model

When trying to train a custom speech model using a dataset containing an audio file and its transcript, the model failed to train due to an internal error. Can anyone provide any insights on how to troubleshoot this issue?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-03T08:53:22.2033333+00:00
Engineering 0 Reputation points
commented 2024-05-07T05:30:28.07+00:00
Engineering 0 Reputation points
0 answers

Is there any way to dub audios maintaining its original intonation, breaks and speed?

I've a voice audio that has a lot of deeper and higher tones and some breaks and "word-emphasis" in specific moments, but, when using the "Speech Translation" functionality, this audio loses all of its life (all this complexity),…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-02T20:04:02.48+00:00
Lucas 0 Reputation points
commented 2024-05-07T04:57:48.79+00:00
santoshkc 4,435 Reputation points Microsoft Vendor
0 answers

关于Azure AI Speech “zh-CN-XiaochenNeural” 音色异常

Since early April, the tone of the "Xiaochen" model has been experiencing abnormalities. At that time, attempts were made in regions such as East Asia, Southeast Asia, and the East US, all of which showed abnormalities, except for the Japan…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-06T23:40:20.7666667+00:00
斌 周 0 Reputation points
0 answers

Not able to use Azure AI Speech Avatar on ReactJs

Hello, I am trying to implement Live chat avatar using ReactJS in my application. When implementing the sample code, I am getting the following console logs: is TURN server active? yes Avatar started. Speech and avatar synthesized to video…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-06T07:43:18.2433333+00:00
Jivi Health 0 Reputation points
commented 2024-05-06T22:02:39.7666667+00:00
VasaviLankipalle-MSFT 14,576 Reputation points
0 answers

Azure Text to Speech F0 (Free) Tier Limits

Hi, I have the F0 (Free) Tier. I send a request to TTS service and get the blendshape data and voice. When I make a request, the first 4 get a response. The 5th one does not return a response anymore. If i restart my server, I can make another 4 request…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-04T20:04:25.47+00:00
Rob Enriquez 0 Reputation points
commented 2024-05-06T21:41:52.48+00:00
Rob Enriquez 0 Reputation points
1 answer

Speech Recognition Live transcription not detecting any other language instead of English

Hi, I am using Speech Recognition resource in my application for live transcription. It's perfectly going with English language but when I am trying to say in Hindi then it's not detecting. I want to create my application for multiple languages used in…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
Azure AI Language
Azure AI Language
An Azure service that provides natural language capabilities including sentiment analysis, entity extraction, and automated question answering.
359 questions
asked 2024-04-27T06:16:27.79+00:00
Jagwant singh 0 Reputation points
commented 2024-05-06T11:31:21.9566667+00:00
Jagwant singh 0 Reputation points
0 answers

zh-CN-XiaochenNeural Abnormal timbre

zh-CN-XiaochenNeural, abnormal timbre. The same problem occurred in October last year. https://learn.microsoft.com/en-us/answers/questions/1431823/the-timbre-of-the-voice-of-zh-cn-xiaochenneural-ha —————————————————————— How long will it take to recover…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-04-11T11:39:56.2733333+00:00
斌 周 0 Reputation points
commented 2024-05-05T22:24:41.89+00:00
斌 周 0 Reputation points
1 answer

Azure Speech AI service Custom Commands Alternative

Hi, we are looking forward to using Azure AI services especially the speech service to build a bot that does certain tasks based on speech for example if we ask the bot to "Make a reservation for Instrument A from 9 AM to 10 AM" then the bot…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-05T06:48:57.4766667+00:00
Varun Surana 0 Reputation points
answered 2024-05-05T07:34:32.5+00:00
Gowtham CP 155 Reputation points
0 answers

Is it possible to stream Groq LLM responses as and when I get it into Azure TTS?

Hi! I'm trying to build a real time LLM conversation bot, and need it to be as low latency as possible. I have successfully set up TTS audio output streaming…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-03T08:10:45.06+00:00
arunnair 0 Reputation points
commented 2024-05-04T15:58:01.0066667+00:00
dupammi 6,815 Reputation points Microsoft Vendor
1 answer

How to have multiple mstts:audioduration in a single <speak>?

I'm trying to adjust the duration of individual phrases so that the synthesized voice matches with the voice in the original audio. It's working perfectly when done like this: <speak xmlns="http://www.w3.org/2001/10/synthesis"…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-02T21:27:23.34+00:00
Lucas 0 Reputation points
answered 2024-05-03T05:11:09.3433333+00:00
dupammi 6,815 Reputation points Microsoft Vendor
1 answer

Do Text to Speech containers TTS provide visemes and blendshapes like the API?

I'm currently using the Speech API and consuming the visemes and blendshapes that are returned. In an effort to reduce latency I would like to run the speech services locally via the text to speech container. Does the response of the container STT…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-04-22T05:35:00.3933333+00:00
Matt Ma 0 Reputation points
answered 2024-05-02T23:05:47.7666667+00:00
Matt Ma 0 Reputation points
0 answers

OpenSSL Issue When Running Azure Speech to text on docker

Hey folks, I'm trying to run speech-to-text using Python on a docker container, but I'm getting an SSL error, I have tried following the steps mentioned here for SSL setup and also installed the required dependencies as mentioned here. However, I'm still…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-01T10:44:52.22+00:00
Ayush Kumar 0 Reputation points
commented 2024-05-02T08:48:56.48+00:00
romungi-MSFT 42,286 Reputation points Microsoft Employee
0 answers

Exception while running Azure Speech to text SDK with jar file (UnsatisfiedLinkError , setTempDirectory)

Hi Team I'm getting errors while running my Java jar in Windows and centos7, However the same is running fine in my Eclipse IDE. The issue is coming when I build the jar and run it in the environment. The error details are below: 2024-05-01 15:50:10…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-01T12:21:38.4333333+00:00
Ayush Kumar 0 Reputation points
edited a comment 2024-05-01T21:24:43.2666667+00:00
VasaviLankipalle-MSFT 14,576 Reputation points
0 answers

Customizing a Conversation Model for a Hebrew Car Sales Call Center

Hi, I am looking for guidance on the process of customizing a model to transcribe conversations in a Hebrew car service call center. The conversations predominantly involve Hebrew-specific domain terms and non-dictionary words. Could you provide some…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,411 questions
asked 2024-05-01T11:09:25.73+00:00
Shahar Spencer 60 Reputation points
commented 2024-05-01T21:13:12.54+00:00
VasaviLankipalle-MSFT 14,576 Reputation points