speech-embeddings

The Dis-Vector project enhances voice conversion and synthesis through disentangled embeddings, allowing for high-quality, zero-shot voice cloning across multiple languages. This model leverages separate encoders for content, pitch, rhythm, and timbre, enabling precise control over synthesized voice characteristics.

voice-conversion zero-shot-learning few-shot low-resource-languages voice-cloning speech-embeddings

Updated Sep 20, 2024
Python

epistoteles / predicting-speaker-quality

Star

This repository belongs to my Bachelor's thesis on predicting voice likability from pre-trained speech embeddings.

speech-processing speech-quality speech-embeddings speech-likability

Updated May 18, 2021
Python

Improve this page

Add a description, image, and links to the speech-embeddings topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-embeddings topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-embeddings

Here are 6 public repositories matching this topic...

usc-sail / gen-dmcca

jvel07 / wav2vec2_patho

jvel07 / dnn_embeddings_pytorch

peter-yh-wu / cross-lingual

NN-Project-1 / dis-Vector-Embedding

epistoteles / predicting-speaker-quality

Improve this page

Add this topic to your repo