Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation.
Tsu-Jui FuLicheng YuNing ZhangCheng-Yang FuJong-Chyi SuWilliam Yang WangSean BellPublished in: CVPR (2023)
Keyphrases
- multimedia
- video data
- real time
- video sequences
- video analysis
- video content
- video streams
- natural language descriptions
- video clips
- real time video
- key frames
- video frames
- space time
- multimedia data
- closed captions
- video search
- video database
- video processing
- text mining
- video segments
- video images
- text information
- text detection
- natural language
- online video
- video collections
- information retrieval
- database