Multimodal Frame-Scoring Transformer for Video Summarization.
Jeiyoon ParkKiho KwounChanhee LeeHeuiseok LimPublished in: CoRR (2022)
Keyphrases
- visual content
- video summarization
- key frames
- video retrieval
- audio visual
- video content
- video frames
- video summaries
- video data
- visual information
- video sequences
- low level features
- video browsing
- multi modal
- visual data
- fault diagnosis
- fuzzy logic
- surveillance videos
- multimedia
- object tracking
- object recognition
- sports video