Let's Think Frame by Frame: Evaluating Video Chain of Thought with Video Infilling and Prediction.
Vaishnavi HimakunthalaAndy OuyangDaniel RoseRyan HeAlex MeiYujie LuChinmay SonarMichael SaxonWilliam Yang WangPublished in: CoRR (2023)
Keyphrases
- video frames
- key frames
- video sequences
- input video
- temporal coherence
- successive frames
- single frame
- video data
- multimedia
- video content
- video streams
- video segmentation
- reference frame
- image frames
- dynamic scenes
- temporal correlation
- video signals
- video objects
- video images
- video clips
- video database
- frame rate
- real time video
- space time
- real time
- temporal continuity
- adjacent frames
- compressed video
- consecutive frames
- digital video
- spatial and temporal
- video processing
- inter frame
- video summarization
- video analysis
- prediction accuracy
- computer vision
- stereoscopic video
- neural network