Audio-Visual Glance Network for Efficient Video Recognition.
Muhammad Adi NugrohoSangmin WooSumin LeeChangick KimPublished in: CoRR (2023)
Keyphrases
- audio visual
- video summarization
- multimedia
- multi modal
- visual data
- audio visual content
- visual information
- audio features
- meeting room
- object recognition
- video data
- multimodal fusion
- video streams
- pattern recognition
- feature extraction
- human activities
- video sequences
- data sets
- dimensionality reduction
- audio visual speech recognition
- video retrieval
- space time
- high dimensional data