Login / Signup
RegionBLIP: A Unified Multi-modal Pre-training Framework for Holistic and Regional Comprehension.
Qiang Zhou
Chaohui Yu
Shaofeng Zhang
Sitong Wu
Zhibing Wang
Fan Wang
Published in:
CoRR (2023)
Keyphrases
</>
multi modal
multi modality
image annotation
high dimensional
audio visual
state space
computer vision
semantic concepts