Login / Signup
Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling.
Haogeng Liu
Qihang Fan
Tingkai Liu
Linjie Yang
Yunzhe Tao
Huaibo Huang
Ran He
Hongxia Yang
Published in:
CoRR (2023)
Keyphrases
</>
cross modal
multi modal
visual data
video sequences
multimedia
multimedia retrieval
semantic concepts
video data
video content
video streams
multimedia databases
video analysis
video frames
visual similarity
multimedia data
key frames
image data
image retrieval
event detection
video retrieval
feature vectors