Audio-Visual Glance Network for Efficient Video Recognition.
Muhammad Adi NugrohoSangmin WooSumin LeeChangick KimPublished in: ICCV (2023)
Keyphrases
- audio visual
- visual data
- video summarization
- multi modal
- multimedia
- visual information
- meeting room
- video data
- video sequences
- action recognition
- object recognition
- data sets
- multi stream
- video streams
- pattern recognition
- audio features
- activity recognition
- temporal context
- audio visual content
- gait recognition
- video frames
- contextual information
- text mining
- web pages