Multi-modal feature fusion based on multi-layers LSTM for video emotion recognition.
Weizhi NieYan YanDan SongKun WangPublished in: Multim. Tools Appl. (2021)
Keyphrases
- multi modal
- emotion recognition
- audio visual
- feature fusion
- video search
- audio features
- feature extraction
- facial expressions
- video sequences
- video data
- multiple features
- multimedia
- data sets
- video frames
- multiple modalities
- image annotation
- video content
- human computer interaction
- video retrieval
- information fusion
- visual data
- high dimensional
- neural network
- text mining
- object recognition