In my scenario, when does the cost occur in Cognitive Speech Services?

Question

In my scenario, when does the cost occur in Cognitive Speech Services?

DevKya 0

Hello everyone,

I am using MS Azure Cognitive service - Speech To Text.

I have written a WebSocket using the Python library azure-cognitive services-speech and Django Channels to transmit real-time audio data. I am curious when the cost is incurred. Here is a scenario:

1.When the WebSocket is connected, I use to connect to MS.

speech_recognizer.start_continuous_recognition_async()

2.Audio Byte data continues to be transmitted(Byte data is transmitted even if voice is not present), and I use the callback functions

speech_recognizer.recognizing.connect(self.callback_recognizing)

speech_recognizer.recognized.connect(self.callback_recognized)

to receive the transcribed text when the audio is meaningful.

When the WebSocket connection is cut off, I disconnect from MS using this code.

self.speech_recognizer.stop_continuous_recognition_async()

When is the cost incurred in this scenario?

Is it for the entire time connected to MS, or is it for the length (time) of the meaningful text data received?

I couldn't find the answer to this.

I would appreciate your help.

jormin 5 Reputation points

2023-08-08T11:25:25.7033333+00:00

In the context of Microsoft's Cognitive Speech Services, costs typically occur based on the usage of the services provided. Cognitive Speech Services encompass various features related to speech recognition, conversion, and understanding. The costs associated with these services are generally dependent on factors such as the number of requests, the duration of audio processed, and the specific features utilized.

For example, if you're using speech-to-text conversion services, you might incur costs based on the number of audio minutes processed. Similarly, for features like speaker recognition or language understanding, costs could be tied to the number of transactions or the amount of processing involved.

It's important to refer to Microsoft's pricing documentation or the specific terms associated with the Cognitive Speech Services you're using to get accurate and up-to-date information on when and how costs are incurred in your particular scenario.

1 answer

Your answer

jormin 5 Reputation points

2023-08-08T11:25:25.7033333+00:00

In the context of Microsoft's Cognitive Speech Services, costs typically occur based on the usage of the services provided. Cognitive Speech Services encompass various features related to speech recognition, conversion, and understanding. The costs associated with these services are generally dependent on factors such as the number of requests, the duration of audio processed, and the specific features utilized.

For example, if you're using speech-to-text conversion services, you might incur costs based on the number of audio minutes processed. Similarly, for features like speaker recognition or language understanding, costs could be tied to the number of transactions or the amount of processing involved.

It's important to refer to Microsoft's pricing documentation or the specific terms associated with the Cognitive Speech Services you're using to get accurate and up-to-date information on when and how costs are incurred in your particular scenario.

Answer 1

@DevKya The billing is done based on audio hours for speech to text and it starts as soon as the service receives audio after you call start_continuous_recognition_async() or the corresponding REST API. I understand a voice or meaningful audio might start later in the stream or audio file but from the service perspective the start and end of audio or audio file duration is used to calculate billing. In the above case, the audio passed between start_continuous_recognition_async() and stop_continuous_recognition_async(). I hope this helps!!

If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Share via

In my scenario, when does the cost occur in Cognitive Speech Services?

1 answer

Your answer