Hi , I am using custom trained model for speech to text for spanish language , how can I get the transcription without diacritics. I tried using custom display formatting using rewrite . But it seems it does not work. Any help is really appreciated

Shweta Ruparel 0 Reputation points
2023-12-23T22:23:23.41+00:00

These are the examples I tried in custom display formatting using #rewrite

- sí	si
- qué	que
- gustaría	gustaria
- ¿Puedo	puedo
- balsámico	balsamico
Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,876 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Ramr-msft 17,741 Reputation points
    2023-12-25T03:12:59.07+00:00

    Thanks for the question, To remove diacritics from the transcription in a custom trained model for speech to text, you can post-process the output of your model. Here’s a general approach: Transcribe the Speech: Use your custom trained model to transcribe the speech to text.

    Remove Diacritics: After you get the transcription, you can use a Python library like unidecode to remove diacritics from the Spanish text.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.