VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding.
Ofir AbramovichNiv NaymanSharon FogelInbal LaviRon LitmanShahar TsiperRoyee TichauerSrikar AppalarajuShai MazorR. ManmathaPublished in: CoRR (2024)
Keyphrases
- document understanding
- designing effective
- automatic summarization
- document clustering
- automatic text summarization
- multi document summarization
- optical character recognition
- computer vision
- information retrieval
- document summarization
- named entity recognition
- text summarization
- data mining
- text documents
- document collections
- information retrieval systems
- machine learning