Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Build custom non-English dataset with ARPABET #34

Open
Opdoop opened this issue Apr 1, 2022 · 2 comments
Open

Build custom non-English dataset with ARPABET #34

Opdoop opened this issue Apr 1, 2022 · 2 comments

Comments

@Opdoop
Copy link

Opdoop commented Apr 1, 2022

Hi, thanks for opening this project. I'm a newbie in VC and I try to add a new speaker to assem-vc.
In Prepare Metadata section, @wookladin uses python datasets/g2p.py to convert transcription into ARPABET.
For custom dataset other than English, e.g. Mandarin Chinese, how to build metadata?
I searched for g2p and find https://github.com/kakaobrain/g2pM, a Grapheme-to-Phoneme Conversion tool for Chinese. But the generated results are PinYin, not ARPABET format. This really confuses me. Could we use PinYin for Chinese to build metadata?

@Opdoop
Copy link
Author

Opdoop commented Apr 1, 2022

I find a phonetic notation called International Phonetic Alphabet(IPA) which support 100+ language for grapheme-to-phoneme. Maybe we can use IPA as the phoneme set for multilingual VC? I'll try it and see the performance.

@Opdoop
Copy link
Author

Opdoop commented Apr 2, 2022

No. I'll not dive into this building way. I just try any-to-many example with provided checkpoints on my custom English source wav. The female voice of high-frequency part is omitted by the model. The naturalness of the audio is worse than other baseline examples on the demo page. The results are disappointing. And I expect that the cross-lingual VC could be even worse. I don't get why changing timbre is that difficult. 😢

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant