A multimodal fusion-based deep learning framework combined with keyframe extraction and spatial and channel attention for group emotion recognition from videos.

Published in: Pattern Anal. Appl. (2023)

Keyphrases