Domain Generalization through Audio-Visual Relative Norm Alignment in First Person Action Recognition.
Mirco PlanamenteChiara PlizzariEmanuele AlbertiBarbara CaputoPublished in: WACV (2022)
Keyphrases
- action recognition
- audio visual
- human actions
- multi modal
- visual data
- bag of words
- activity recognition
- visual information
- computer vision
- action classification
- multi stream
- multimedia
- recognition of human actions
- domain knowledge
- recognizing human actions
- action recognition in videos
- contextual information
- object recognition
- recognizing actions
- data points
- high level