DEV Community

Jochem Stoel
Jochem Stoel

Posted on

3 1

Explain to me this big difference between SAPI5 and Google speech synthesizer voices like I am five.

Traditional Windows SAPI5 speech synthesis voices are language specific. The voice is designed/recorded to be a specific language. This means that if you feed English text to a French voice it will read and pronounce it as if it were French, making it sound idiotic.

Google has a speech synthesis service as well, available in Chrome browser and as API and it behaves a lot differently. If you feed English text to a Dutch Google voice, it speaks/pronounces it properly but it gets a strong Dutch accent. This is not possible with traditional text to speech.

Please describe both processes and explain to me like I am 5 the fundamental difference between them that is responsible for this.

Thanks!

Sentry image

See why 4M developers consider Sentry, “not bad.”

Fixing code doesn’t have to be the worst part of your day. Learn how Sentry can help.

Learn more

Top comments (0)

A Workflow Copilot. Tailored to You.

Pieces.app image

Our desktop app, with its intelligent copilot, streamlines coding by generating snippets, extracting code from screenshots, and accelerating problem-solving.

Read the docs

👋 Kindness is contagious

Please leave a ❤️ or a friendly comment on this post if you found it helpful!

Okay