Multi-sentence video captioning using spatial saliency of video frames and content-oriented beam search algorithm.
Masoomeh NabatiAlireza BehradPublished in: Expert Syst. Appl. (2023)
Keyphrases
- video frames
- spatial and temporal
- saliency map
- search algorithm
- video footage
- video data
- video sequences
- temporal redundancy
- video segmentation
- video segments
- video streams
- video content
- inter frame
- successive frames
- visual saliency
- video clips
- temporal coherence
- key frames
- natural scenes
- input video
- video images
- foreground objects
- single frame
- spatio temporal
- multimedia
- video dataset
- video database
- static images
- camera movement
- video summarization
- monocular video sequences
- video shots
- object segmentation
- space time
- motion estimation
- multi frame
- video compression
- video analysis
- news video
- video retrieval
- human motion
- video recordings
- multi view
- object detection
- moving objects
- metadata