Login / Signup
Multimodal Video Summarization via Time-Aware Transformers.
Xindi Shang
Zehuan Yuan
Anran Wang
Changhu Wang
Published in:
ACM Multimedia (2021)
Keyphrases
</>
video summarization
audio visual
multi modal
video browsing
video content
video summaries
visual information
video data
visual data
event detection
video sequences
multimedia
surveillance videos
low level features
video retrieval
video frames
face recognition
three dimensional