VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding.

Ofir Abramovich Niv Nayman Sharon Fogel Inbal Lavi Ron Litman Shahar Tsiper Royee Tichauer Srikar Appalaraju Shai Mazor R. Manmatha

Published in: CoRR (2024)

Keyphrases