Login / Signup

A multimodal fusion-based deep learning framework combined with keyframe extraction and spatial and channel attention for group emotion recognition from videos.

Shubao QiBaolin Liu
Published in: Pattern Anal. Appl. (2023)
Keyphrases
  • deep learning
  • emotion recognition
  • audio visual
  • multimodal fusion
  • human computer interaction
  • feature space
  • multimedia
  • training set
  • information extraction