Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could we have web speech API support IPA for speech synthesis voice language #109

Open
Thaina opened this issue Mar 2, 2023 · 2 comments

Comments

@Thaina
Copy link

Thaina commented Mar 2, 2023

Instead of language specific voice, we could share all voice for many language with IPA specification. We can translate word from many language into string of IPA and let voice synthesis read out from the same shared voice model. Because many language shared the same pronunciation of vocab, it should decrease overall voice model data

We should also have API for IPA dictionary per language and so we could convert text that may contains many language into IPA and synthesize the voice to play it with one sentence

@LeaVerou
Copy link

LeaVerou commented Dec 26, 2023

Came here to suggest this as well. IPA offers an escape hatch for use cases that need a higher level of control.

My use case: I’ve been playing around with text-to-speech for a toddler reading app, and finding the language specification too inconsistent. E.g. I want pressing a letter to make the sound the letter makes, but that seems impossible, at least in Greek. Instead the letter name is spoken. Furthermore, even for entire syllables, I've seen platforms just pronounce the letter names one after another, whereas others speak the actual syllable.

Since most text-to-speech systems are trained on samples from real languages, it could still take a language as a parameter. That would even enable use cases like "Speak English with a French accent" (by converting the English to IPA and setting the language to French).

@jankapunkt
Copy link

Why not just allow to load custom trained voices?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants