Incomplete transcript when the recording has long pause before conversation resume again

Question

Incomplete transcript when the recording has long pause before conversation resume again

Wanyee Lee 20

I am having problem to have complete transcription when my recordings have trends like below:

There a long silent in between the conversation, i.e., there is conversation from minute 1-2 then a long pause from min2-min10 then there is conversation again from min11 - min13 and so on so forth

I am using Azure batch transcription for Thai and the transcript generated only consist of the conversation from minute 1-2 and then it will not generate the transcript for rest of the conversation

What should I do to fix it? Please advise, thanks in advance!

Manas Mohanty 5,700 Reputation points Microsoft External Staff Moderator

2025-06-18T00:18:09.3166667+00:00

Hi Wanyee Lee

Please let us know if the pointers from Danny Dang preprocessing the audio files helped address your issue.

Thank you.
Manas Mohanty 5,700 Reputation points Microsoft External Staff Moderator

2025-06-18T17:12:43.2666667+00:00

Hi Wanyee Lee

We could not hear from you. Hope the pointers shared were useful to you.

Thank you.
Wanyee Lee 20 Reputation points

2025-06-19T01:47:48.36+00:00

Thanks for the suggestion, this is not helpful for me because I need the timestamp of the silent part as well, if i cut the video and remove the silent portion, i need to inject it back again to get the timestamp, does this limitation with Azure batch transcription will be rectified soon?
Manas Mohanty 5,700 Reputation points Microsoft External Staff Moderator

2025-06-19T23:24:27.88+00:00

Hi Wanyee Lee

Not sure on ETA for fixing this limitation on Batch transcription side. Alternatively, you can switch to Whisper models

Feel free to post this as feedback on Feedback forum.

Thank you.

Accepted answer

0 additional answers

Your answer

Manas Mohanty 5,700 Reputation points Microsoft External Staff Moderator

2025-06-18T00:18:09.3166667+00:00

Hi Wanyee Lee

Please let us know if the pointers from Danny Dang preprocessing the audio files helped address your issue.

Thank you.
Manas Mohanty 5,700 Reputation points Microsoft External Staff Moderator

2025-06-18T17:12:43.2666667+00:00

Hi Wanyee Lee

We could not hear from you. Hope the pointers shared were useful to you.

Thank you.
Wanyee Lee 20 Reputation points

2025-06-19T01:47:48.36+00:00

Thanks for the suggestion, this is not helpful for me because I need the timestamp of the silent part as well, if i cut the video and remove the silent portion, i need to inject it back again to get the timestamp, does this limitation with Azure batch transcription will be rectified soon?
Manas Mohanty 5,700 Reputation points Microsoft External Staff Moderator

2025-06-19T23:24:27.88+00:00

Hi Wanyee Lee

Not sure on ETA for fixing this limitation on Batch transcription side. Alternatively, you can switch to Whisper models

Feel free to post this as feedback on Feedback forum.

Thank you.

Answer 1

Hi Wanyee,

Thank you for contacting Microsoft Q&A Forum.

The issue you're experiencing with incomplete transcriptions due to long pauses in your recordings is a known limitation. Currently, Azure Batch Transcription does not have any parameter to configure silence time.

Steps to Fix:

Preprocess Your Audio:

Consider preprocessing your audio files to remove or reduce long pauses before submitting them for transcription. This can help ensure that the entire conversation is captured in the transcript.

Here is a sample ffmpg

ffmpeg -i input_audio.wav -af silenceremove=start_periods=1:start_duration=1:start_threshold=-30dB:stop_periods=1:stop_duration=1:stop_threshold=-30dB output_trimmed.wav

start_periods=1: Begins trimming after one silent period.
start_duration=1: Silence must last at least 1 second to be trimmed.
start_threshold=-30dB: Anything quieter than -30dB is considered silence.
stop_periods=1, stop_duration=1, stop_threshold=-30dB: Same logic applies to the end of silent segments.

Reference:

https://learn.microsoft.com/en-us/azure/ai-services/speech-service/batch-transcription-create?pivots=rest-api#request-configuration-options

If I have answered your question, please accept this answer as a token of appreciation and don't forget to give a thumbs up for "Was it helpful"!

Best Regards,

Danny Dang 85 Reputation points Independent Advisor

2025-06-20T08:48:02.3366667+00:00

Hi Wanyee,

I hope everything is going well on your end.

Following up on my answer regarding the inquiry, please let me know if there are any additional questions or concerns that require further assistance.

Have a good day ahead!

Share via

Incomplete transcript when the recording has long pause before conversation resume again

0 additional answers

Your answer