Saliency-maximized audio visualization and efficient audio-visual browsing for faster-than-real-time human acoustic event detection.
Kai-Hsiang LinXiaodan ZhuangCamille GoudeseuneSarah KingMark Hasegawa-JohnsonThomas S. HuangPublished in: ACM Trans. Appl. Percept. (2013)
Keyphrases
- audio visual
- event detection
- video summarization
- sports video
- multi modal
- multimedia
- temporal segmentation
- surveillance videos
- visual data
- emotion recognition
- visual information
- multi stream
- video analysis
- event recognition
- audio features
- temporal context
- audio visual speech recognition
- soccer video
- data analysis
- machine learning
- human body
- video data
- sound source