End-To-End Label Uncertainty Modeling for Speech Emotion Recognition (SER) Using Bayesian Neural Networks
This repository contains code for the papers
Navin Raj Prabhu, Guillaume Carbajal, Nale Lehmann-Willenbrock, Timo Gerkmann, "End-To-End Label Uncertainty Modeling for Speech-based Arousal Recognition Using Bayesian Neural Networks", Interspeech, Incheon, Korea, Sep. 2022. [arxiv]
Navin Raj Prabhu, Nale Lehmann-Willenbrock, Timo Gerkmann, "Label Uncertainty Modeling and Prediction for Speech Emotion Recognition using t-Distributions", Affective Computing and Intelligent Interaction (ACII), Nara, Japan, Oct. 2022. [arxiv]
Navin Raj Prabhu, Nale Lehmann-Willenbrock, Timo Gerkmann, "End-to-End Label Uncertainty Modeling in Speech Emotion Recognition using Bayesian Neural Networks and Label Distribution Learning", IEEE Transactions on Affective Computing (TAFFC), May 2023. [arxiv]
Three variants of model and label uncertainty models for SER, introduced by the above papers, is available in this repository.
MU
ModelVariant.model_uncertainty
MU+LU
ModelVariant.label_uncertainty
t-LU
ModelVariant.tstud_label_uncertainty
The usage of these uncertainty models is demonstrated and available in the file unit_test.py.