Deepfake-Audio

👉🏻 An audio deepfake is when a “cloned” voice that is potentially indistinguishable from the real person’s is used to produce synthetic audio.

Google Colaboratory
Kaggle
Model
Project Demo

This project is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time.

SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generate new voices.

👉🏻 Created to Learn the working of Deepfake-Audio 👈🏻

✌🏻 Back To Repository ✌🏻

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Deepfake-Audio

Files

README.md

Latest commit

History

README.md

File metadata and controls

Deepfake-Audio