Sign in

LEGO: Language Enhanced Multi-modal Grounding Model.

Zhaowei LiQi XuDong ZhangHang SongYiqing CaiQi QiRan ZhouJunting PanZefeng LiVan Tu VuZhida HuangTao Wang
Published in: CoRR (2024)
Keyphrases
  • multi modal
  • multimedia
  • multi modality
  • similarity measure
  • programming language
  • image annotation
  • imaging modalities