VCoME: Verbal Video Composition with Multimodal Editing Effects.
Weibo GongXiaojie JinXin LiDongliang HeXinglong WuPublished in: CoRR (2024)
Keyphrases
- multimedia
- video content
- video images
- video sequences
- video data
- video streams
- real time
- multimodal information
- story segmentation
- video database
- video analysis
- video clips
- multimedia data
- space time
- digital video
- multiple modalities
- multimodal interaction
- event recognition
- image processing
- video frames
- multi modal
- spatial and temporal
- medical images
- visual data
- low level
- image quality
- web service composition
- video processing
- video surveillance
- image editing
- real time video
- semantic web
- computer vision
- neural network