Login / Signup

Lumos: Empowering Multimodal LLMs with Scene Text Recognition.

Ashish ShenoyYichao LuSrihari JayakumarDebojeet ChatterjeeMohsen MoslehpourPierce ChuangAbhay HarpaleVikas BhardwajDi XuShicong ZhaoLongfang ZhaoAnkit RamchandaniXin Luna DongAnuj Kumar
Published in: KDD (2024)
Keyphrases