question

tes432-0204 avatar image
2 Votes"
tes432-0204 asked 94493648 answered

Getting audio measurements from Speech Service Transcription

Hello,

Is it possible to take measurements such as volume (db), silence (%), talk over (%) and other assorted data that is necessary for call center operations? I'm referencing this documentation but I can't seem to find any appropriate APIs.

My company is working to utilize Azure services for call center operations and it would be helpful to us if the above mentioned data is available. I'm able to provide more detailed requirements.

Thanks in advance.


azure-cognitive-servicesazure-speech
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

ramr-msft avatar image
1 Vote"
ramr-msft answered ramr-msft commented

@tes432-0204 Thanks for the question.you can use the timestamps of the spoken segments to determine the silence within audio files. We don’t have a dedicated API for silence detection. Please follow the link for more information.


Release notes for Speech SDK API: https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/releasenotes
For other measurements forwarded to the product team to check. You can also share your feedback at the uservoice speech service forum. Can you please share more details about the use case.



· 2
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Hi, I've moved this question to Microsoft Partner's support network. Thanks for the help.

0 Votes 0 ·

Thanks @tes432-0204 .

0 Votes 0 ·
94493648 avatar image
0 Votes"
94493648 answered

II prefer transcribe audio to text, using speech to text converter. It has many additional features and gives you high-quality text products.


5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.