Audio-Visual Event Localization in Unconstrained Videos.
Yapeng TianJing ShiBochen LiZhiyao DuanChenliang XuPublished in: ECCV (2) (2018)
Keyphrases
- audio visual
- video summarization
- video scene
- multi modal
- visual data
- sports video
- audio features
- visual information
- event detection
- multimedia
- temporal context
- video sequences
- multimodal fusion
- audio visual speech recognition
- human activities
- person authentication
- video analysis
- news stories
- video frames
- video content
- key frames
- contextual information
- human actions
- multimedia data
- high level
- e learning