Login / Signup
Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning.
Alex Jinpeng Wang
Linjie Li
Yiqi Lin
Min Li
Lijuan Wang
Mike Zheng Shou
Published in:
CoRR (2024)
Keyphrases
</>
multi modal
auto annotation
video search
cross modal
multi modality
multiple modalities
information retrieval
high dimensional
active learning
audio visual
keywords
low level
information theoretic
text retrieval
automatic image annotation