LSSED: a large-scale dataset and benchmark for speech emotion recognition.
Weiquan FanXiangmin XuXiaofen XingWeidong ChenDongyan HuangPublished in: CoRR (2021)
Keyphrases
- speech emotion recognition
- real world
- small scale
- real life
- comparative analysis
- benchmark datasets
- web scale
- database
- million images
- data sets
- computer vision
- image processing
- case study
- video sequences
- object recognition
- event detection
- artificial intelligence
- neural network
- synthetic datasets
- trecvid multimedia event detection