How to start using VAD? #198

jhoelzl · 2016-10-11T12:12:45Z

Hello,

i am interested in the performance of the VAD in this project. I generated the docu files and found the page about "Building a voice activity detector (VAD)".

I went to the path alex/tools/vad and tried to run some scripts, however, i have no training data. In the documentation these folders are described:

data_vad_sil # a directory with only silence, noise data and its mlf file
data_voip_cs # a directory where CS data reside and its MLF (phoneme alignment)
data_voip_en # a directory where EN data reside and its MLF (phoneme alignment)
model_voip # a directory where all the resulting models are stored.

I suppose i have to make these folders in alex/tools/vad ? But where can i find/download the appropriate audio and mlf files?

Regards,
Josef

The text was updated successfully, but these errors were encountered:

jurcicek · 2016-10-11T12:49:16Z

Unfortunately, we do not have such data publicly available. You have to
have your own.

Best regards,
Filip Jurcicek

Work tel. (CZ): +420221914402
Personal tel. (CZ): +420777805048
Skype: bozskyfilip

http://ufal.mff.cuni.cz/filip-jurcicek

On 11 October 2016 at 14:12, Josef Hölzl [email protected] wrote:

Hello,

i am interested in the performance of the VAD in this project. I generated
the docu files and found the page about "Building a voice activity detector
(VAD)".

I went to the path alex/tools/vad and tried to run some scripts, however,
i have no training data. In the documentation these folders are described:

data_vad_sil # a directory with only silence, noise data and its mlf file
data_voip_cs # a directory where CS data reside and its MLF (phoneme
alignment)
data_voip_en # a directory where EN data reside and its MLF (phoneme
alignment)
model_voip # a directory where all the resulting models are stored.

I suppose i have to make these folders in alex/tools/vad ? But where can
i find/download the appropriate audio and mlf files?

Regards,
Josef

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#198, or mute the thread
https://github.com/notifications/unsubscribe-auth/AEmNUW90OKcog8iaTjRkf887gMog1NN9ks5qy309gaJpZM4KTjsv
.

jhoelzl · 2016-10-11T13:53:57Z

Hello @jurcicek, thanks for reply.

I realized that i have to generate the mlf files using Kaldi or HTK.
Therefore i started the script train_voip_en.sh in alex/tools/kaldi/data_voip_en/ and also set my variable KALDI_ROOT.

Then i added some required directories to model_voip_en and data_voip_en, like dev, test,train.
When i run the script train_voip_en.sh i get some empty files:

model_voip_en
└───local
│   └───dev
│       │   spk2gender
│       │   spk2utt
│       │   trans.txt
│       │   utt2spk
│       │   wav.scp
│   └───test
    │   ...
│   └───train
    │   ...

and therefore the script stops:

Initializing set 'dev' output files
Initializing set 'test' output files
Initializing set 'train' output files
--- Distributing the file lists to train and (dev test x build0 build2) directories ...
utils/validate_data_dir.sh: empty file spk2utt

Can you tell me what is all about these files?

jurcicek · 2016-10-11T15:15:49Z

You need train, dev, test audio and their transcriptions. Given this you
can train an acoustic model, which will force align the transcriptions and
generate the MLF file. Then you can train VAD.

DISCLAIMER: This part of the code was used about 3 years ago for the last
time. It may not even work.

Best regards,
Filip

Personal tel. (CZ): +420777805048
Skype: bozskyfilip

On 11 October 2016 at 15:53, Josef Hölzl [email protected] wrote:

Hello @jurcicek https://github.com/jurcicek, thanks for reply.

I realized that i have to generate the mlf files using Kaldi or HTK.
Therefore i started the script train_voip_en.sh in
alex/tools/kaldi/data_voip_en/ and also set my variable KALDI_ROOT.

Then i added some required directories to model_voip_en and data_voip_en,
like dev, test,train.
When i run the script train_voip_en.sh i get some empty files:

model_voip_en
└───local
│ └───dev
│ │ spk2gender
│ │ spk2utt
│ │ trans.txt
│ │ utt2spk
│ │ wav.scp
│ └───test
│ ...
│ └───train
│ ...

and therefore the script stops:

Initializing set 'dev' output files
Initializing set 'test' output files
Initializing set 'train' output files
--- Distributing the file lists to train and (dev test x build0 build2)
directories ...
utils/validate_data_dir.sh: empty file spk2utt

Can you tell me what is all about these files?

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#198 (comment), or mute
the thread
https://github.com/notifications/unsubscribe-auth/AEmNUS0IR3CAQD3hGt_5EaCvsChpv_Hpks5qy5T2gaJpZM4KTjsv
.

cappelll · 2016-12-20T11:10:25Z

Hi Filip,

say I've trained a VAD model, is there already a script that outputs the VAD results, using as input the model file and an audio?

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to start using VAD? #198

How to start using VAD? #198

jhoelzl commented Oct 11, 2016

jurcicek commented Oct 11, 2016

jhoelzl commented Oct 11, 2016

jurcicek commented Oct 11, 2016

cappelll commented Dec 20, 2016

How to start using VAD? #198

How to start using VAD? #198

Comments

jhoelzl commented Oct 11, 2016

jurcicek commented Oct 11, 2016

jhoelzl commented Oct 11, 2016

jurcicek commented Oct 11, 2016

cappelll commented Dec 20, 2016