question

JyotsnaRavi-6184 avatar image
0 Votes"
JyotsnaRavi-6184 asked AnuragSharma-08 edited

Android 10/11 with Azure Speech - Workaround to record audio (Voice call) & pass chunks or stream audio (Voice call) to Azure Speech service

Android 10/11 requires specific permission for recording audio. Require help in recording the voice call or streaming the voice call in real time to Azure speech SDK. This was successfully implemented with Android 8.

azure-speechfasttrack-azure-startup
· 3
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Hi, I'm a developer from the above mentioned project, our objective is to transcribe voice calls in android. The problem in accessing audio during a call is due to android's tighter restrictions introduced for ensuring user privacy on android 10 and above. When we attempt to capture audio, the incoming voice just registers silence or in some cases - extremely damped audio. We have used Azure Speech SDK using the AudioRecord API for converting speech to text from the incoming call audio and it works well on android 8/9 but not on android 10/11. We cannot use VOICE_CALL audio source as android has restricted it's access to system apps only so we had to use VOICE_RECOGNITION which uses the device microphone to capture audio. The speaker's voice is getting processed and transcribed perfectly but it fails to capture the other person's voice. I have created a sample repository for explaining this issue with the respective stack traces.



0 Votes 0 ·

Thanks for providing more details. I'm making some inquiries, will provide updates as soon as possible.

0 Votes 0 ·

Started an email thread with the product team, let's continue offline. Thanks.

0 Votes 0 ·

1 Answer

GiftA-MSFT avatar image
0 Votes"
GiftA-MSFT answered GiftA-MSFT commented

Hi, thanks for your feedback. You are correct, my previous response was about recognizing from microphone. The following document shows how to recognize from file. Instead of calling fromDefaultMicrophoneInput(), you call fromWavFileInput() and pass the file path. For your scenario, you'd want to consider using continuous recognition. I'm not aware of any limitations when using android 10/11 with speech sdk. If your question is specific to android 10/11 and recording audio, then maybe consider using 'dotnet-android' forum tag on this platform or post a question on android community forum for the community experts to review.

· 2
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

Thanks for the reply. The customer wants to use Android 10/11 with speech SDK.
Can we get some help from Android experts from MS? There are some permissions required for recording or streaming voice calls on Android 10. Any work around scenarios which our engineers have come across would be a good pointer.

0 Votes 0 ·
GiftA-MSFT avatar image GiftA-MSFT JyotsnaRavi-6184 ·

Thanks for your feedback. I'm making some inquiries with engineers internally. It would be helpful if you can clarify what permission issues you're facing so I can pass that information along.

0 Votes 0 ·