Login / Signup

TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding.

Bozhi LuanHao FengHong ChenYonghui WangWengang ZhouHouqiang Li
Published in: CoRR (2024)
Keyphrases