From 3eeb78e24f97e1e1aaeda208fe4d47ea924deb40 Mon Sep 17 00:00:00 2001 From: SWivid Date: Wed, 16 Oct 2024 13:13:36 +0800 Subject: [PATCH] Update README --- README.md | 11 ++++++++++- 1 file changed, 10 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 0a2350a..7a83f47 100644 --- a/README.md +++ b/README.md @@ -1,16 +1,25 @@ # F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching +
+
+ Watermark +
+
+ [![python](https://img.shields.io/badge/Python-3.10-brightgreen)](https://github.com/SWivid/F5-TTS) [![arXiv](https://img.shields.io/badge/arXiv-2410.06885-b31b1b.svg?logo=arXiv)](https://arxiv.org/abs/2410.06885) [![demo](https://img.shields.io/badge/GitHub-Demo%20page-blue.svg)](https://swivid.github.io/F5-TTS/) [![space](https://img.shields.io/badge/🤗-Space%20demo-yellow)](https://huggingface.co/spaces/mrfakename/E2-F5-TTS) +[![lab](https://img.shields.io/badge/X--LANCE-Lab-grey?labelColor=lightgrey)](https://x-lance.sjtu.edu.cn/) **F5-TTS**: Diffusion Transformer with ConvNeXt V2, faster trained and inference. -**E2 TTS**: Flat-UNet Transformer, closest reproduction. +**E2 TTS**: Flat-UNet Transformer, closest reproduction from [paper](https://arxiv.org/abs/2406.18009). **Sway Sampling**: Inference-time flow step sampling strategy, greatly improves performance +### Thanks to all the contributors ! + ## Installation Clone the repository: