How to Use "Real-time speech to text" feature of Azure Cognitive Speech Service in ServiceNow?

Question

How to Use "Real-time speech to text" feature of Azure Cognitive Speech Service in ServiceNow?

Carmel Franco Raj 0

If any of you has coded or know about this, please let me know

2 answers

Your answer

Answer 1

Deleted

This answer has been deleted due to a violation of our Code of Conduct. The answer was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.

Comments have been turned off. Learn more

Answer 2

Jerald Felix 1,630

Hello Carmel,

There's no native ServiceNow plugin for Azure Speech-to-Text, you can implement it by using a custom UI widget + Azure WebSocket API or an external integration using MID server / integration hub / REST messages.

Custom UI Component with WebSocket Client

Use a Service Portal widget or Now Experience UI Framework to connect to Azure Speech.

Steps:

Create Azure Speech Resource

Go to Azure portal > Create Speech resource.

  Copy the key and region.
  
  **Enable WebSocket in browser (via JavaScript)**
  
     Use browser mic (`getUserMedia`)
     
        Stream audio via **Azure STT WebSocket endpoint**
        
           Parse and display the result in real-time
           
           **Embed this in a ServiceNow Widget or UI page**

GitHub sample JS client: https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/quickstart/javascript/browser

Option 2: Node.js Backend + ServiceNow Integration

If WebSocket handling in ServiceNow frontend is too complex:

Build a Node.js Express backend that:

Uses Azure Speech SDK (npm microsoft-cognitiveservices-speech-sdk)

  Accepts audio input (from browser or app)
  
     Returns transcript
     
     **Expose this backend as REST API**
     
     In **ServiceNow**, create:
     
        REST Message to call backend
        
           Scripted UI to handle audio upload and fetch transcription
           
              Use transcription in Incident/Case/Record creation

Option 3: Integration Hub + Azure REST API (Not Real-Time)

Use batch transcription with Integration Hub or REST Message.

Less real-time, but easier to set up.

You post audio blob → Azure returns transcript.

Azure Batch STT Docs

Best Regards,

Jerald Felix

Carmel Franco Raj 0 Reputation points

2025-05-19T18:14:32.19+00:00

Hi Jerald Felix,
Thank you for your response.

I am simply trying to Convert the text to Speech and Speech to Text
with a Demo Catalog item:

as shown here :

ref catalog item update set : sys_remote_update_set_c376f22ec3e16e10e193b61ed4013176.xml

then to use the Azure API on the portal side, I created 2 Widgets (XML export of update sets):
sp_widget (sys_updated_onON2025-05-19@javascript_gs.dateGenerate('2025-05-19','start')@javascrip.xml
then to call Azure APIs I created this script include
sys_script_include_3a350222c3216e10e193b61ed4013127.xml

how ever it's giving me blank text when I try to convert speech to text( somewhere it's not sending my actual recording) appreciate if you can find and let me that.

When I try to convert text to speech and try to attach it to the portal attachment (paperclip) it's not showing the response from server appreciate if you can find and let me that.
Ravada Shivaprasad 535 Reputation points Microsoft External Staff Moderator

2025-05-23T22:36:48.2766667+00:00

Hi Carmel Franco Raj

Your issue with speech-to-text appears to stem from the audio input not being transmitted correctly to Azure. Ensure the recording is in PCM WAV format (16-bit, 16kHz) and that your API key and endpoint are correctly configured. Additionally, log the audio data before sending it to Azure to verify if the input is valid. If the response is still blank, testing with the Azure Speech SDK can confirm whether the issue is within ServiceNow or Azure.

For text-to-speech, the missing response in attachments may be due to improper handling of binary data. Verify that Azure is returning a valid audio file and that the format is compatible with ServiceNow (MP3, WAV, OGG). Use the GlideSysAttachment API to correctly store the file. If attachments still fail to appear, check whether the file content type is correctly set before uploading.

Reference : Azure Speech Service Troubleshooting Guide , Service Now Attachment Handling Documentation
Let me know if you need script refinements.
Thanks
Ravada Shivaprasad 535 Reputation points Microsoft External Staff Moderator

2025-05-26T23:00:41.5933333+00:00

Hi Carmel Franco Raj

Just checking in to see if the above answer helped. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Thanks
Ravada Shivaprasad 535 Reputation points Microsoft External Staff Moderator

2025-05-27T23:06:56.24+00:00

Hi Carmel Franco Raj

Following up to see if the above answer was helpful. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Thanks

Share via

How to Use "Real-time speech to text" feature of Azure Cognitive Speech Service in ServiceNow?

2 answers

Your answer