Login / Signup

LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding.

Yanzhe ZhangRuiyi ZhangJiuxiang GuYufan ZhouNedim LipkaDiyi YangTong Sun
Published in: CoRR (2023)
Keyphrases