audio file
03-01-01-01-01-01-01.wav
RAVDESS Dataset
https://smartlaboratory.org/ravdess/
emotion label
Emotion: neutral
Confidence: 0.99993193
"surprised", "neutral", "calm", "happy",
"sad", "angry", "fearful", "disgust"
This model requires additional module.
pip3 install librosa
$ python3 transformer-cnn-emotion-recognition.py -i input.wav
PyTorch 1.6.0
ONNX opset = 11