Login / Signup

Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models.

Songtao JiangYan ZhangChenyi ZhouYeying JinYang FengJian WuZuozhu Liu
Published in: CoRR (2024)
Keyphrases