Login / Signup
DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming.
Jiaxin Zhang
Wentao Yang
Songxuan Lai
Zecheng Xie
Lianwen Jin
Published in:
CoRR (2024)
Keyphrases
</>
high level
visual information