Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning.
Ligong HanJian RenHsin-Ying LeeFrancesco BarbieriKyle OlszewskiShervin MinaeeDimitris N. MetaxasSergey TulyakovPublished in: CoRR (2022)
Keyphrases
- multimedia
- video sequences
- video data
- video streams
- video content
- video analysis
- real time
- temporal information
- multi modal
- space time
- video retrieval
- video frames
- program synthesis
- spatio temporal
- story segmentation
- video database
- video clips
- multimodal information
- spatial and temporal
- online video
- multiple modalities
- key frames
- multimodal interaction
- real time video
- video processing
- quality metrics
- video segmentation
- multimedia data
- visual information
- hidden markov models
- moving objects
- data sets