Environmental noise and data sparsity present major challenges within the field of bioacoustics. This paper explores noise reduction (audio enhancement) techniques in the context of extremely sparse vocalisations (<1% occurrence rates) of invasive mammalian and marsupial species and the clear implications for other bioacoustics applications which face similar challenges. We compare relevant noise reduction techniques and recommend a spectral subtraction approach that outperforms alternative approaches in terms of stationary noise reduction, efficiency, and data requirements. We identify how the contributions of this work can be applied within the broader context of bioacoustics. We also explore the current benefits and limitations of state-of-the-art deep audio enhancement approaches within the context of bioacoustics applications.
pip install -r requirements.txt
Repository accompanying paper (Automatic Noise Reduction of Extremely Sparse Vocalisations for Bioacoustic Monitoring). Includes the New Zealand (NZ) Invasive Predator test dataset /audio/predator
. Find NZ Native Bird dataset at: https://github.com/smarsland/AviaNZ or /audio/bird
Full results (.wav) can be found at /results
, this includes Wavelet Packet Decomposition (WPD), spectral subtraction, CMGAN (spectral) and CMGAN (additive) for both datasets.
SnNR, Success Ratio (SR) and PSNR metric can be generated using evaluate.py
. Select results to evaluate from /results
and the corresponding dataset (.json), either Bird_dataset.json
or Predator_dataset.json
Find feature spacing test audio at /feature_spacing
. This is evaluated using manual_evaluation.py
@article{McEwen2023,
title = {Automatic noise reduction of extremely sparse vocalisations for bioacoustic monitoring},
journal = {Ecological Informatics},
pages = {102280},
year = {2023},
issn = {1574-9541},
doi = {https://doi.org/10.1016/j.ecoinf.2023.102280},
author = {Ben McEwen and Kaspar Soltero and Stefanie Gutschmidt and Andrew Bainbridge-Smith and James Atlas and Richard Green},
keywords = {Audio enhancement, Bioacoustics, Noise reduction, Perceptual quality, Signal processing},
}
This research was made possible through Capability Development funding through Predator Free 2050 Limited (https://pf2050.co.nz/). AviaNZ (https://github.com/smarsland/AviaNZ) for use of their WPD method and NZ native bird dataset. Tim Sainbury (https://github.com/timsainb/noisereduce) for use of the noisereduce package (spectral subtraction) Cao et al. (https://github.com/ruizhecao96/CMGAN) for access to CMGAN.