Opening and reading PDF files in Microsoft Edge on Windows 10
This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
Hello!
I noticed that there seems to be some text to speech engine that Microsoft Edge is using within its own immersive reader, I would like to know if it is possible to convert the entire audio of the immersive reader's text to speech into an mp3 file! It also seems to be able to be run locally, so I don't need to connect to the internet. Is there an API for this text to speech engine?
To those that are about to point me in the direction of Microsoft's Speech Services, I have already used it, and I love it! I am merely looking at other options of text to speech engines that can run locally (on my machine). I have looked into flite (CMU's text engine), mimic (AI-powered engine based off CMU's flite text engine), espeak-ng (extremely robotic, sadly), and other engines, but that is another story for another day.
Any suggestions?
Opening and reading PDF files in Microsoft Edge on Windows 10
Locked Question. This question was migrated from the Microsoft Support Community. You can vote on whether it's helpful, but you can't add comments or replies or follow the question.
Well, I just had an interesting discovery... I found the exact text to speech engine that Microsoft Edge uses. Another interesting discovery is that the immersive reader and the read aloud for PDF seems to be running off different text to speech engines.
The PDF Read-Aloud feature seems to be using Microsoft Azure's text to speech engine. What's interesting, is that the Azure text to speech engine requires the internet, but Microsoft Edge seems to be running a local copy to run the PDF Read-aloud text to speech. I wonder how to use a local copy instead of connecting to the internet, and converting it to mp3, similar to this: https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/get-started-text-to-speech?tabs=terminal&pivots=programming-language-cliBut it seems the above link requires an internet connection for the API... How does Microsoft Edge do it locally?
I do not want to have to record the text to speech clip using audacity. Rather, I want to use the same text-to-speech engine on Microsoft edge to generate the audio file for me. This will save on both computation power, and time for generating the file.
audacity + tts engine = audio
vs
tts engine’s API = audio
See if this helps, a recorder for Windows text to speech:
As mentioned, I was particularly looking for an API for the text-to-speech engine. I am well aware that obs (screensharing tool) and ffmpeg and audacity all the other open-source tools exists, and can even save the audio. However, I want it to immediately run, so that I don't have to wait for a long time before it finally finishes...
I am using the text to speech engine so that I can generate long audio clips without waiting that entire length.
For example, if I wanted to say the sentence:
The quick brown fox jumps over the lazy dog
I don't want to wait for 10 seconds of recording and another minute to export, etc. that's too much time, I wanted it to be instantaneously (or almost instant) generated, kind of like the other options I mentioned.
Hi Rubiks. I'm Greg, 10 years awarded Windows MVP, here to help you.
Have you tried Audacity? See more about that recording option for Edge Read Aloud here:
https://www.softwarert.com/record-audio-from-ed...
https://www.tenforums.com/browsers-email/187363...
https://helpdeskgeek.com/windows-10/the-best-wi...
https://free-screen-recorder.en.softonic.com/wi...
Feel free to ask back any questions. Based on the results you post back I may have other suggestions if necessary.
______________________________________________
Standard Disclaimer: There are links to non-Microsoft websites. The pages appear to be providing accurate, safe information. Watch out for ads on the sites that may advertise products frequently classified as a PUP (Potentially Unwanted Products). Thoroughly research any product advertised on the sites before you decide to download and install it.