Audio-Visual Saliency for Omnidirectional Videos.
Yuxin ZhuXilei ZhuHuiyu DuanJie LiKaiwei ZhangYucheng ZhuLi ChenXiongkuo MinGuangtao ZhaiPublished in: ICIG (5) (2023)
Keyphrases
- audio visual
- video summarization
- sports video
- multi modal
- visual data
- audio features
- visual information
- multi stream
- video sequences
- person authentication
- vision system
- temporal context
- video content
- video frames
- video data
- multimodal fusion
- audio visual speech recognition
- multimedia
- human activities
- data sets
- low level
- feature vectors
- feature selection