Skip to content

sp-uhh/label-uncertainty-ser

Repository files navigation

End-To-End Label Uncertainty Modeling for Speech Emotion Recognition (SER) Using Bayesian Neural Networks

This repository contains code for the papers

Navin Raj Prabhu, Guillaume Carbajal, Nale Lehmann-Willenbrock, Timo Gerkmann, "End-To-End Label Uncertainty Modeling for Speech-based Arousal Recognition Using Bayesian Neural Networks", Interspeech, Incheon, Korea, Sep. 2022. [arxiv]

Navin Raj Prabhu, Nale Lehmann-Willenbrock, Timo Gerkmann, "Label Uncertainty Modeling and Prediction for Speech Emotion Recognition using t-Distributions", Affective Computing and Intelligent Interaction (ACII), Nara, Japan, Oct. 2022. [arxiv]

Navin Raj Prabhu, Nale Lehmann-Willenbrock, Timo Gerkmann, "End-to-End Label Uncertainty Modeling in Speech Emotion Recognition using Bayesian Neural Networks and Label Distribution Learning", IEEE Transactions on Affective Computing (TAFFC), May 2023. [arxiv]

Model Variants

Three variants of model and label uncertainty models for SER, introduced by the above papers, is available in this repository.

Model Uncertainty and Label Uncertainty models

MU

ModelVariant.model_uncertainty 

MU+LU

ModelVariant.label_uncertainty 

alt text

t-distribution Label Uncertainty model

t-LU

ModelVariant.tstud_label_uncertainty 

alt text

Usage

The usage of these uncertainty models is demonstrated and available in the file unit_test.py.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages