Transcription Denormalization.

Alex Cohen 10 Reputation points
2024-07-19T15:31:59.8+00:00

Is there a way to "denormalize" Azure speech transcription, so it provides verbatim transcription (as close as possible, with word fillers, hesitations, repeats, etc)? I will also need word level timestamping and diarization.

I am hoping there is a simple function, like "normalize = False" for some Whisper packages.

Thanks

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,762 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,889 questions
{count} vote

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.