Login / Signup
Caption Anything: Interactive Image Description with Diverse Multimodal Controls.
Teng Wang
Jinrui Zhang
Junjie Fei
Hao Zheng
Yunlong Tang
Zhe Li
Mingqi Gao
Shanshan Zhao
Published in:
CoRR (2023)
Keyphrases
</>
image description
image understanding
data modeling
image representation
feature detection
databases
low level features
image retrieval
visual features
multiscale
multimedia
video sequences
object recognition
multi class
co occurrence
distributed systems
feature extraction
metadata
data mining
database