Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How can I add an audio dataset for Bashkir language? #85

Open
AigizK opened this issue Nov 17, 2022 · 2 comments
Open

How can I add an audio dataset for Bashkir language? #85

AigizK opened this issue Nov 17, 2022 · 2 comments
Labels
New Language New Language that isn't yet supported

Comments

@AigizK
Copy link

AigizK commented Nov 17, 2022

I have an audio dataset (audio and transcript in Cyrillic) with a female voice in the Bashkir language. What should I do to get you to support our language as well?

@NeonClary
Copy link
Member

Hello @AigizK,
I have added Bashkir to the list of languages we're planning in our next group. A dataset is exactly what we'll need. Where is your dataset at the moment? We need look at and listen to it please, to see if it will do well with our existing models. I do think adding Bashkir will be simpler than some other languages, since we already have included other Cyrillic alphabet languages.

Our team has discussed what's most efficient for our resources, and we'd like to do several language requests at once. We plan to do it after finishing the project our STT/TTS team is working on right now, and before starting the next one. That shouldn't be long. You are welcome to email me directly at [email protected] if that's a better way to show us your dataset.

@AigizK
Copy link
Author

AigizK commented Nov 24, 2022

@NeonClary thank you, I sent email with link to dataset.

@NeonDaniel NeonDaniel added the New Language New Language that isn't yet supported label Nov 28, 2022
@NeonBohdan NeonBohdan changed the title How can I add an audio dataset for my language? How can I add an audio dataset for Bashkir language? Mar 2, 2023
@NeonDaniel NeonDaniel moved this to Low Priority in Community Projects May 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
New Language New Language that isn't yet supported
Projects
Status: Low Priority
Development

No branches or pull requests

3 participants