Question 1

Why is the voice list empty when I first open the tool?

Accepted Answer

Voice loading is asynchronous in most browsers. Chrome, in particular, fires the voiceschanged event a fraction of a second after page load. The 'Loading voices...' placeholder waits for that event and replaces itself with the dropdown once the list is ready. If voices never appear, your browser may have speech synthesis disabled, or you are on a platform without TTS engines installed.

Question 2

Can I download the audio as an MP3?

Accepted Answer

Not from the Web Speech API directly. It plays through your speakers but does not expose a recordable audio stream. To capture the output, use your operating system's audio recorder pointed at the system audio device, or screen-record with audio. For pure file output, dedicated TTS APIs (ElevenLabs, OpenAI TTS, Amazon Polly) are the right tool.

Question 3

How long can the input text be?

Accepted Answer

There is no formal cap, but most browsers stop or stutter around 32,000 characters in a single utterance. For long documents, split into paragraphs and play sections individually. Reading speed at 1.0x is roughly 250 words per minute, so a 1,000-word piece takes about four minutes to read aloud.

Question 4

Does it work offline?

Accepted Answer

Yes, once the page has loaded. The synthesis runs entirely on your device using whatever TTS engines your operating system provides. This is why voices differ between Windows, macOS, Linux and mobile - each platform ships its own engines and the API exposes whatever is locally installed.

Text to Speech Previewer

How the Text to Speech Previewer Works

Voice Quality Differs Between Engines

Frequently Asked Questions

Why is the voice list empty when I first open the tool?

Can I download the audio as an MP3?

How long can the input text be?

Does it work offline?

Related Tools