Deep-net fusion to classify shots in concert videos.
Wen-Li WeiJen-Chun LinTyng-Luh LiuYi-Hsuan YangHsin-Min WangHsiao-Rong TyanHong-Yuan Mark LiaoPublished in: ICASSP (2017)
Keyphrases
- video clips
- video data
- key frames
- video sequences
- news video
- video shots
- video frames
- tv series
- video database
- multi modal fusion
- video analysis
- video summarization
- data fusion
- tv news
- video streams
- video scene
- video summaries
- video content
- automatic classification
- web videos
- visual content
- information fusion
- temporal segmentation
- visual features
- video indexing
- video classification
- video material
- audio features
- image sequences
- image classification
- event recognition
- video segments
- fusion method
- dynamic scenes
- fusion methods
- video segmentation
- person identification
- multimedia
- multi sensor
- image fusion
- human actions
- feature vectors
- temporal video segmentation
- human activities