A Benchmark for Controllable Text -Image-to-Video Generation.
Yaosi HuChong LuoZhenzhong ChenPublished in: IEEE Trans. Multim. (2024)
Keyphrases
- image data
- single image
- image content
- image analysis
- input image
- image representation
- multiscale
- image retrieval
- image features
- web images
- multimedia
- video images
- static images
- multimedia data
- image classification
- image segmentation
- textual descriptions
- key frames
- image collections
- multimedia documents
- text generation
- semantic labels
- text detection
- information retrieval
- image frames
- pixel values
- text mining
- edge detection
- low level
- visual cues
- visual data
- video search
- video content
- text information
- image regions
- feature points