Turning Text and Imagery into Captivating Visual Video.
Mingming WangElijah MillerPublished in: CoRR (2024)
Keyphrases
- video search
- news video
- content based video retrieval
- visual data
- visual cues
- video content
- visual information
- video database
- natural language descriptions
- visual analysis
- video retrieval
- video data
- text detection
- video sequences
- visual features
- video segments
- semantic labels
- text mining
- image data
- multimedia
- video shots
- information retrieval
- multimedia data
- video streams
- key frames
- video frames
- web images
- video collections
- video clips
- real time
- multi modal
- computer vision
- event detection
- text information
- textual descriptions
- visual motion
- keywords
- street view
- space time
- temporal information
- closed captions
- concept detectors
- wide area motion imagery
- visual saliency
- multimedia documents
- semantic content
- video analysis
- content based retrieval
- information extraction
- high resolution
- high level
- search engine