GitHub - MusinguziDenis/Luganda-ASR

This repo contains code for finetuning wav2vec models on datasets in the Mozilla common voices dataset.

The finetuning script can be run on Google Colab.

The model is Wav2Vec Bert model with a WER of 19.33.

You can download the model from Huggingface and use it directly to produce the same results.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.ipynb_checkpoints		.ipynb_checkpoints
img		img
wav2vec/notebook		wav2vec/notebook
.gitignore		.gitignore
Data-Exploration.ipynb		Data-Exploration.ipynb
Fine_Tune_W2V2_BERT_on_CV7_Luganda.ipynb		Fine_Tune_W2V2_BERT_on_CV7_Luganda.ipynb
README.md		README.md
finetune_wav2vec.py		finetune_wav2vec.py
finetuning_wav2vec.json		finetuning_wav2vec.json
finetuning_wav2vec.sh		finetuning_wav2vec.sh
luganda_vocab.json		luganda_vocab.json
requirements.txt		requirements.txt
vocab.json		vocab.json

Provide feedback