Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cantonese support? #172

Open
Naozumi520 opened this issue Nov 14, 2023 · 4 comments
Open

Cantonese support? #172

Naozumi520 opened this issue Nov 14, 2023 · 4 comments

Comments

@Naozumi520
Copy link

Naozumi520 commented Nov 14, 2023

Is your feature request related to a problem? Please describe.
I'm always frustrated to see that the Chinese dialect "Cantonese" doesn't get enough attention. As a HongKonger, Cantonese is the language that I speak everyday. However, there are not much resources of it. I know VITS_Chinese did support Cantonese. However, the result are not very good. I would like to see if wetts can support this dialect with 85.5 million of speakers.

Describe the solution you'd like
TTS that support Cantonese, also with bert.

Describe alternatives you've considered
There are no alternatives.... VITS_Chinese and PaddleSpeech is the only one. However, as I said before, the result is not very good.

Additional context
https://huggingface.co/indiejoseph/bert-base-cantonese
https://github.com/yeyupiaoling/VITS-Pytorch/blob/master/mvits/text/cantonese.py
PaddlePaddle/PaddleSpeech#2669

@pengzhendong
Copy link
Member

pengzhendong commented Nov 20, 2023

I am a Cantonese. I will focus on it. Training data is the biggest problem right now.

@Naozumi520
Copy link
Author

Naozumi520 commented Nov 20, 2023

Ah yes.... Can we use Common Voice? I used sovits to convert the dataset into a single voice for vits.

@Naozumi520
Copy link
Author

It sounds okay, but without bert it's not very natural

1700493557_._nozomiCantonese.mov

@pengzhendong
Copy link
Member

粤英混的 g2p:https://github.com/pengzhendong/g2p-mix

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants