How to transcribe interview with two speakers from a single audio file similar to word 365 using spx recognize cli
Hello Everyone,
I have a series of interviews recorded as MP3 files and i would like to use Azure speech CLI to transcribe them in a way similar to the integrated word 365 transcriptor format which is:
I would like to use the Azure Speech service, because the WER is much lower. I tried using:
spx recognize --file audio.mp3 --format mp3 --language en-US --output all text --output all file output.tsv
But the output format doesn't provide timestamps and speakers. I see a lot interview
options in the help but I couldn't figure out what I would need to produce a the information i require.
The output doesn't need to be plain text, i can postprocess it to get to the format I need, but i would like to get something that contains TIME - SPEAKER_ID - UTTERANCE
Thanks,
Nick