How to remove the generated 'Sous-titres réalisés para la communauté d'Amara.org'

Amper, Charwin (Contractor) 85 Reputation points
2024-01-15T05:37:33.3066667+00:00

Hi, when using Speech Whisper Model via Batch Transcription. I noticed that there is an extra line of texts added in the JSON output. User's image

The texts line in red are not even spoke or heard in the source audio. The image above is using the French locale but maybe it could happen on other locales also. What could be the reason why it is there and how to remove it?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,078 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,115 questions
{count} votes

Accepted answer
  1. navba-MSFT 27,550 Reputation points Microsoft Employee Moderator
    2024-01-16T05:59:34.01+00:00

    @Amper, Charwin (Contractor) Thanks for your reply. While I used the fr-Fr locale, I am able to reproduce this issue at my end. I had reached out to the Speech Service Product Group team and got and update that, "This is the hallucination in Whisper model itself because of its training data. translate Sous-titres réalisés para la communauté d'Amara.org to English is Subtitles created by the Amara.org community which is the common pattern observed in Whisper's training data source. So this is an expected behavior.
    .
    Currently, We are not making changes to Whisper's behavior. Also, there is no workaround on Whisper model in Batch. You could choose other default SR model in batch service to see that meets your quality requirement. Hope this answers.

    1 person found this answer helpful.
    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.