We would like to know noises effect to speech to text performance

Kohei Watanabe 41 Reputation points
2021-10-19T00:24:10.79+00:00

Hello. We are using Azure Japanese speech to text.

We want to evaluate its performance.

What parameters affect the result, noises or microphones or intonations or etc....?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,383 questions
0 comments No comments
{count} votes

Accepted answer
  1. romungi-MSFT 41,861 Reputation points Microsoft Employee
    2021-10-19T09:40:25.207+00:00

    @Kohei Watanabe Azure speech to text provides two options with respect to the models that are used behind the service.

    1. Baseline Model
    2. Custom Model

    With the baseline model you can use the API directly without any customization where the model is trained by Microsoft against fairly decent background conditions. If your scenario involves recognition of speech in a day to day scenario or recordings this model should work for you right away with the API.

    The custom model can be used in a scenario where the baseline model accuracy is not to your standards. For example, you have custom words in your speech like acronyms, phrases used in an organization, speech from factory floor with lot of background noises etc. This custom model is trained with your audio files on top of the baseline model so all the capabilities of baseline model are built in your resultant endpoint.

    To summarize, the result from the service depends on your scenario and all the factors do effect them but if your scenario isn't for a custom background then you can use the baseline model rightaway and evaluate the performance. Please check the FAQ document that could help you with more details.

    1 person found this answer helpful.

2 additional answers

Sort by: Most helpful
  1. Kohei Watanabe 41 Reputation points
    2021-10-25T08:29:48.92+00:00

    mmm, the model could be trained by both audio and text but just text seems to be used...

    This UI is really confusing....

    143308-%E3%82%B9%E3%82%AF%E3%83%AA%E3%83%BC%E3%83%B3%E3%82%B7%E3%83%A7%E3%83%83%E3%83%88-2021-10-25-170910.png

    0 comments No comments