METRO: Multi-channel ExTension of pRe-trained mOdel implementation based on WeSpeaker

This repository currently provides an implementation of a multi-channel extension of WavLM Base+. Please refer to examples/multisv for an example with the MultiSV dataset.

Installation

Clone this repo

git clone https://github.com/BUTSpeechFIT/Wespeaker_MC_SSL.git

Create conda env: pytorch version >= 1.10.0 is required

conda create -n wespeaker python=3.9
conda activate wespeaker
conda install pytorch=1.12.1 torchaudio=0.12.1 cudatoolkit=11.3 -c pytorch -c conda-forge
pip install -r requirements.txt

The repository implements the following paper

@inproceedings{metro,
  author={Ladislav Mošner and Romain Serizel and Lukáš Burget and Oldřich Plchot and Emmanuel Vincent and Junyi Peng and Jan Černocký},
  title={{Multi-Channel Extension of Pre-trained Models for Speaker Verification}},
  year=2024,
  booktitle={Proc. Interspeech}
}

Name		Name	Last commit message	Last commit date
Latest commit History 247 Commits
.github/workflows		.github/workflows
docs		docs
examples/multisv		examples/multisv
runtime		runtime
tools		tools
wespeaker		wespeaker
.flake8		.flake8
.gitignore		.gitignore
CPPLINT.cfg		CPPLINT.cfg
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

METRO: Multi-channel ExTension of pRe-trained mOdel implementation based on WeSpeaker

Installation

The repository implements the following paper

About

Releases

Packages

Contributors 2

Languages

License

BUTSpeechFIT/Wespeaker_MC_SSL

Folders and files

Latest commit

History

Repository files navigation

METRO: Multi-channel ExTension of pRe-trained mOdel implementation based on WeSpeaker

Installation

The repository implements the following paper

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages