Login / Signup

DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming.

Jiaxin ZhangWentao YangSongxuan LaiZecheng XieLianwen Jin
Published in: CoRR (2024)
Keyphrases
  • high level
  • visual information