Document cleanup using page frame detection.
Faisal ShafaitJoost van BeusekomDaniel KeysersThomas M. BreuelPublished in: Int. J. Document Anal. Recognit. (2008)
Keyphrases
- keywords
- information retrieval
- page layout analysis
- detection accuracy
- object detection
- document clustering
- frame rate
- automatic detection
- detection rate
- detection method
- detection algorithm
- information retrieval systems
- website
- document type
- false positives
- text documents
- web documents
- document classification
- www pages
- web pages
- document retrieval
- false alarms
- html documents
- page segmentation