Skip to content

Commit

Permalink
update
Browse files Browse the repository at this point in the history
  • Loading branch information
cantabile-kwok committed Apr 18, 2024
1 parent 12f2e0b commit 20a066e
Show file tree
Hide file tree
Showing 9 changed files with 93,198 additions and 26,971 deletions.
24 changes: 18 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ StoryTTS is a highly expressive text-to-speech dataset that contains rich expres

## Download

* [Google Drive (5.06 GB)](https://drive.google.com/file/d/1KuD-6c2yxLqPhNJHCaE1jIee-8TjjsB6/view?usp=drive_link)
* Please download the speech data from [Google Drive (5.06 GB)](https://drive.google.com/file/d/1KuD-6c2yxLqPhNJHCaE1jIee-8TjjsB6/view?usp=drive_link)

### Note

Expand All @@ -24,11 +24,23 @@ StoryTTS is a highly expressive text-to-speech dataset that contains rich expres

## File Description

* `text/transcript` : The transcripts of StoryTTS
* `text/label` : The annotation labels of StoryTTS
* `text/prompt_claude2`: Prompt and instruction for Claude2
* `text/prompt_gpt4`: Prompt and instruction for GPT4
* `text/wav.scp`: Path of wav files
* `dataset/transcript` : The transcripts of StoryTTS in simplified Chinese with puncuations.

* `dataset/utt2dur`: The duration (in seconds) of each utterance.

* `dataset/utt2spk`: The speaker name of each utterance, i.e. the name of the only speaker in StoryTTS.

* `dataset/label` : The annotation labels of StoryTTS. Note: currently labels for some utterances are missing, and will be complemented very soon. The format of this file is as follows:

```
utt-ID 句式(Sentence Pattern)|修辞手法(Rhetoric Device)|场景(Scene)|情感色彩(Emotional colors)|模仿人物(Imitated Characters)
```

* `dataset/prompt_claude2`: Prompt and instruction for Claude2.

* `dataset/prompt_gpt4`: Prompt and instruction for GPT4.

* `dataset/wav.scp`: Path of wav files. Note: might be changed according to your location of storing the speech data.

## Citation

Expand Down
26,963 changes: 26,963 additions & 0 deletions dataset/label

Large diffs are not rendered by default.

File renamed without changes.
File renamed without changes.
2 changes: 1 addition & 1 deletion text/transcript → dataset/transcript
Original file line number Diff line number Diff line change
Expand Up @@ -33105,4 +33105,4 @@ LianLiru-ZSDFS-episode160-seg234 东方朔把自己的一袋土倒下,卫青也
LianLiru-ZSDFS-episode160-seg235 卫青流着眼泪说,去病啊,你先走一步,舅舅早晚要随你而去呀!武帝听了这句话,大吃一惊!
LianLiru-ZSDFS-episode160-seg236 霍去病走了
LianLiru-ZSDFS-episode160-seg237 卫青大将军呢
LianLiru-ZSDFS-episode160-seg238 智圣东方朔的故事还没有讲完,请继续收听铁血卷天怒
LianLiru-ZSDFS-episode160-seg238 智圣东方朔的故事还没有讲完,请继续收听铁血卷天怒
Loading

0 comments on commit 20a066e

Please sign in to comment.