Login / Signup

HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models.

Runhui HuangXinpeng DingChunwei WangJianhua HanYulong LiuHengshuang ZhaoHang XuLu HouWei ZhangXiaodan Liang
Published in: CoRR (2024)
Keyphrases