Is it possible to hard force stress when training xtts2 model #28

brambox · 2024-05-25T10:46:15Z

brambox
May 25, 2024

It seems that no mater how much you train the model it will very often put stress on different positions. Is there any way using vocab or something to hard force stress position to always be accurate and don't pull the rng and put the stress what ever it decides it?

eginhard · 2024-05-25T13:45:25Z

eginhard
May 25, 2024
Maintainer

The old-school way would be to train a model on phonemes with stress labels, but of course XTTS is not even trained on phonemes. Since the model doesn't directly receive that information, there is no way to force a specific stress and you can only hope it guesses correctly from the context when it's trained on more data.

At inference time, you could try something like capitalising the syllable that should be stressed. During training, you could try to steer the model in that direction by capitalising the stressed syllables there as well.

0 replies

brambox · 2024-05-25T16:12:52Z

brambox
May 25, 2024
Author

thank you! Isn't cleaners auto converting all to lowercase on inference ?

1 reply

eginhard May 26, 2024
Maintainer

Right, then this won't work out of the box either.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to hard force stress when training xtts2 model #28

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{title}}

Select a reply

Is it possible to hard force stress when training xtts2 model #28

brambox May 25, 2024

Replies: 2 comments · 1 reply

eginhard May 25, 2024 Maintainer

brambox May 25, 2024 Author

eginhard May 26, 2024 Maintainer

brambox
May 25, 2024

Replies: 2 comments 1 reply

eginhard
May 25, 2024
Maintainer

brambox
May 25, 2024
Author

eginhard May 26, 2024
Maintainer