Azure multiple files recognition and confidence

margotmg 1 Reputation point
2021-06-02T14:10:47.49+00:00

hi!
So I'd like to transcribe multiple audio files using a customized model from Azure and continuous speech recognition. I don't want to use the batch transcription, I want to transcribe multiple files from my computer simultaneously. Is this possible using the python script shown here? https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/get-started-speech-to-text?tabs=windowsinstall&pivots=programming-language-python

Every time I try to test more than one audio file simultaneously I get the following error:

RuntimeError: Exception with an error code: 0x8 (SPXERR_FILE_OPEN_FAILED)
[CALL STACK BEGIN]

Also, I'm trying to get the confidence per word. Is this possible in python? I've seen this command in other questions but doesn't seem to work.

config.output_format = speechsdk.OutputFormat(1)

Maybe it's doing something but it doesn't return anything?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,069 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. GiftA-MSFT 11,176 Reputation points
    2021-06-03T00:39:10.31+00:00

    Hi, thanks for reaching out. One approach could be to loop through files in a directory and then transcribe. Regarding getting confidence per word, I only found examples for C#, I'm making some inquiries and will share updates for python accordingly.

    1 person found this answer helpful.

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.