-
Notifications
You must be signed in to change notification settings - Fork 66
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Details on TTS evaluation? #42
Comments
Please refer to Appendix C. |
Hi, I have a few questions about zero-shot TTS evaluation using the VCTK dataset. 1. Evaluation Methodology:
In the paper, particularly in Appendix C, the evaluation process seems a bit open to interpretation. Could you please provide a detailed description of how the evaluation was conducted? 2. Dataset Usage: |
Hi, I'm currently trying to reproduce your results on the TTS task, too. My performance on the VCTK 0.92 dataset yielded a WER of around 27 and a speaker similarity score of approximately 71. Here’s a summary of my reproduction setup:
Could you please provide more details regarding the evaluation process for the TTS task? Thank you! |
Hello! Thanks for your wonderful work. Trying to reproduce your results on the TTS task, I'm wondering if you could provide more details about the evaluation of the TTS task, especially:
Thanks!
The text was updated successfully, but these errors were encountered: