SpeechSDK - for Compressed audio

Test Admin 176 Reputation points
2023-07-10T05:40:25.4366667+00:00

We are using SpeechSDK for audio file transcription.

https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/how-to-use-codec-compressed-audio-input-streams

As per the above link, for compressed audio we can Gstreamer for compressed audio's(MP4,MP3 etc )transcription.

When we tried with less the 5 MB Mp4 file, transcription text is received.

But when we tried with 20 MB or 30 MB more MP4 file, transcription text is not received(In console below message is displayed "Session stopped event.")

Code which we are using for compressed audio,

    	PullAudioInputStream pullAudio = AudioInputStream.createPullStream(new BinaryAudioStreamReader("//D://sample.mp4"),
    		    AudioStreamFormat.getCompressedFormat(AudioStreamContainerFormat.ANY));
    		AudioConfig audioConfigs = AudioConfig.fromStreamInput(pullAudio);
    		audioConfigs.setProperty(PropertyId.Speech_LogFilename, "D:\\Log.txt");

We checked the log file,

[563724]: 1790ms SPX_TRACE_ERROR: base_gstreamer.cpp:196 Error from GStreamer: Source: qtdemux0

Message: GStreamer encountered a general stream error.

DebugInfo: ../gst/isomp4/qtdemux.c(7517): gst_qtdemux_process_adapter (): /GstPipeline:pipeline/GstDecodeBin:decodebin/GstQTDemux:qtdemux0:

no 'moov' atom within the first 10 MB.

Could you verify this and help us to overcome this error.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,061 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.