Any available resources for developing a live recording and transcript app?

AdamSardo-9959 0 Reputation points
2024-02-26T13:27:09.6466667+00:00

I'm new to development but I can work with templates and examples. Are there any resources available for creating a live audio recording and transcript app? I want to implement this feature into my service offering and have checked Azure GitHub but couldn't find anything there. Thank you for your help!

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,664 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,895 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,772 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. Charlie Wei 3,320 Reputation points
    2024-02-26T14:06:13.68+00:00

    Hello AdamSardo-9959,

    If you are a beginner developer who can learn by examples, I believe starting with the speech-to-text sample from the Azure OpenAI Whisper model on Microsoft Learn is a good starting point.

    Best regards,
    Charlie


    If you find my response helpful, please consider accepting this answer and voting 'yes' to support the community. Thank you!


  2. YutongTie-MSFT 50,811 Reputation points
    2024-02-26T18:21:01.7866667+00:00

    @AdamSardo-9959 Thanks for reaching out to us, Whisper Model is a good solution, but you need the access for Azure OpenAI first.

    I would suggest you trying the Real-time speech to text feature if you want to do a live transcribe.

    Please see document here - https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-to-text#real-time-speech-to-text

    QuickStart - https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-recognize-speech?pivots=programming-language-python

    GitHub samples are here - https://github.com/Azure-Samples/cognitive-services-speech-sdk

    If you want to leverage Whisper model, please make sure you apply for the access here - Currently, you must submit an application to access Azure OpenAI Service. To apply for access, complete this form.

    I hope this helps, please have a try and let us know how it works.

    Regards, Yutong

    -Please kindly accept the answer if you feel helpful to support the community, thanks a lot.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.