Capturing Temporal Structures for Video Captioning by Spatio-temporal Contexts and Channel Attention Mechanism.
Dashan GuoWei LiXiangzhong FangPublished in: Neural Process. Lett. (2017)
Keyphrases
- spatio temporal
- attention mechanism
- spatial and temporal
- space time
- spatial temporal
- video representation
- human actions
- spatio temporally
- video data
- multimedia
- visual attention
- video sequences
- video frames
- real time
- visual attention model
- video surveillance
- video content
- video streams
- moving objects
- image sequences
- video retrieval
- video analysis
- video database
- video summarization
- image segmentation
- low level
- key frames
- biologically inspired
- software engineering
- action recognition