Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

如何用 粤语女 的音色来合成普通话呢 #387

Open
16dian11 opened this issue Sep 12, 2024 · 2 comments
Open

如何用 粤语女 的音色来合成普通话呢 #387

16dian11 opened this issue Sep 12, 2024 · 2 comments

Comments

@16dian11
Copy link

No description provided.

@aluminumbox
Copy link
Collaborator

aluminumbox commented Sep 12, 2024

this is not very stable, it is more likely to speak Cantonese

@16dian11
Copy link
Author

16dian11 commented Sep 13, 2024

感谢回复。
是的,这是我比较疑惑的地方。粤语的bpe与普通话看起来并没有区别,但他们的发音很明显是不同的,在您训练的数据中,是在粤语的文本前面添加 | < yue > |进行训练吗?
如果我想要新增一个同样使用汉字的方言,比如四川话,是不是应该把训练时的文本前增加 | < chuan > | 这种类似的标记,然后修改whisper
/tokenizer.py 中的LANGUAGES 字典?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants