Variational Stacked Local Attention Networks for Diverse Video Captioning.
Tonmoay DebAkib SadmaneeKishor Kumar BhaumikAmin Ahsan AliM. Ashraful AminA. K. M. Mahbubur RahmanPublished in: WACV (2022)
Keyphrases
- multimedia
- video streams
- video sequences
- video content
- video data
- video clips
- wide variety
- complex networks
- image segmentation
- real time video
- video frames
- video analysis
- digital video
- key frames
- real time
- social networks
- space time
- high bandwidth
- heterogeneous networks
- video retrieval
- real world
- network size
- methods in computer vision
- video images
- video processing
- network analysis
- video surveillance
- computer networks
- temporal information
- network structure
- object detection
- motion estimation
- spatio temporal