Login / Signup

Enhancing Multimodal Large Language Models with Multi-instance Visual Prompt Generator for Visual Representation Enrichment.

Wenliang ZhongWenyi WuQi LiRobert A. BartonBoxin DuShioulin SamKarim BouyarmaneIsmail B. TutarJunzhou Huang
Published in: CoRR (2024)
Keyphrases