Reading Between the Frames: Multi-Modal Depression Detection in Videos from Non-Verbal Cues.
David Gimeno-GómezAna-Maria BucurAdrian CosmaCarlos David Martínez-HinarejosPaolo RossoPublished in: CoRR (2024)
Keyphrases
- multi modal
- endoscopic video
- video frames
- video scene
- video search
- audio visual
- key frames
- semantic concepts
- multi modality
- cross modal
- fusing multiple
- video data
- image annotation
- multiple modalities
- video sequences
- moving objects
- video clips
- video segments
- motion features
- video analysis
- high dimensional
- single modality
- video content
- event detection