Can Azure Event Hubs ingest audio data directly for real-time processing?

Muhammad Usman 25 Reputation points
2023-10-24T17:30:59.7533333+00:00

I'm exploring options for real-time data processing in Azure, and I'm wondering if Azure Event Hubs can directly ingest audio data for real-time processing. We have a call center with audio recordings, and we'd like to send this audio data to Azure for analysis and transcription as the calls are happening. Is it possible to send audio data, such as WAV files, directly to Azure Event Hubs, or should we use another approach?

I appreciate any insights or guidance on how to achieve this with Azure services. Thank you!

Azure Event Hubs
Azure Event Hubs
An Azure real-time data ingestion service.
721 questions
{count} vote

Accepted answer
  1. PRADEEPCHEEKATLA 90,661 Reputation points Moderator
    2023-10-25T06:20:49.9266667+00:00

    @Muhammad Usman - Thanks for the question and using MS Q&A platform.

    Unfortunately, it's not possible to ingest audio data directly for real-time processing using Azure Event Hubs.

    Azure Event Hubs is a cloud native data streaming service that can stream millions of events per second, with low latency, from any source to any destination. Event Hubs is compatible with Apache Kafka, and it enables you to run existing Kafka workloads without any code changes.

    Diagram that shows how Azure Event Hubs fits in an event streaming platform.

    For real-time audio processing, you may want to consider using Azure Cognitive Services, specifically the Speech Services API. The Speech Services API provides speech-to-text transcription, speaker recognition, and other audio processing capabilities that can be used to analyze audio data in real-time.

    To use the Speech Services API, you can send audio data to the API using a REST API or a client library. The API supports a variety of audio formats, including WAV, MP3, and FLAC. You can also use Azure Blob Storage or Azure Data Lake Storage to store your audio data and then use Azure Functions or Azure Logic Apps to trigger the Speech Services API for real-time processing.

    You may want to consider using Azure Cognitive Services or Azure Stream Analytics to process your audio data in real-time.

    For more details, refer to Audio processing - Speech service - Azure AI services

    Hope this helps. Do let us know if you any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

    1 person found this answer helpful.

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.