Skip to content

Source code for "Pitch-aware generative pretraining improves multi-pitch estimation with scarce data" (MMASIA 2024)

License

Notifications You must be signed in to change notification settings

marypilataki/padac-mmasia24

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

padac-mmasia24

Source code for "Pitch-aware generative pretraining improves multi-pitch estimation with scarce data" (MMASIA 2024)

1. Preparation

2. Stage 1: pretraining

  • Replace dataset paths in conf/padac/pitch_cond_padac.yml with the paths of the dataset you would like to perform pretraining on.
  • Run the following command to start training. Replace ./runs with the path to the folder where you would like the model checkpoints to be saved.
python -m scripts.train_padac --args.load conf/padac/conf_padac.yml --save_path ./runs

3. Stage 2: shallow transcriber training

  • After freezing PA-DAC, extract and save latent space embeddings using latent_space = self.encoder(audio_data). Refer to the script scripts/extract_features.py for an example.
  • Prepare a config file similar to conf/transcriber.json specifying the paths to the extracted features and ground truth.
  • To start training, run the following command:
python -m scripts.train_transcriber --config_file ./conf/transcriber.json

About

Source code for "Pitch-aware generative pretraining improves multi-pitch estimation with scarce data" (MMASIA 2024)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages