Bi-Directional Self-Attention with Relative Positional Encoding for Video Summarization.
Jingxu LinSheng-hua ZhongPublished in: ICTAI (2020)
Keyphrases
- bi directional
- video summarization
- audio visual
- event detection
- video browsing
- video content
- video summaries
- video data
- associative memory
- key frames
- video sequences
- surveillance videos
- video frames
- video retrieval
- feature space
- neural network
- search engine
- object recognition
- natural language processing
- visual attention
- image data
- contextual information
- multi modal