Sign in

ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling.

Siming YanMin BaiWeifeng ChenXiong ZhouQixing HuangLi Erran Li
Published in: CoRR (2024)
Keyphrases