Hi @Ritvij Sharma ,
Thank you for using the Microsoft Q&A forum.
To address your query, you can use the "format" query parameter to specify the output format of the API response. By default, the API response is in "Simple" format, which includes only the recognized text. However, you can use the "Detailed" format to obtain N-Best results in JSON format.
By specifying the "Detailed" format query parameter, the API response will include all the available text formats, such as "Display", "ITN", "Lexical", and "MaskedITN". The "ITN" format will display numbers as numerical values instead of text. You can retrieve this response from the API and integrate it into your application.
You can find a comprehensive list of all the supported query parameters for the Speech-to-text on this page
Thank you!
I hope this helps.
If this answers your query, do click Accept Answer and Yes for was this answer helpful.