Login / Signup
IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension.
Xuyang Liu
Ting Liu
Siteng Huang
Yue Hu
Quanjun Yin
Donglin Wang
Honggang Chen
Published in:
CoRR (2024)
Keyphrases
</>
multi modal
memory efficient
cross modal
external memory
multi modality
audio visual
high dimensional
image annotation
semantic concepts
video search
multiple sequence alignment
iterative deepening
machine learning
data structure
video sequences
integral image