Keyphrases
- data sets
- automatic detection
- information retrieval
- document images
- document retrieval
- information retrieval systems
- semi automatic
- web documents
- detection method
- false positives
- document classification
- detection algorithm
- document collections
- fully automatic
- document clustering
- structured documents
- semantic information
- database
- digital libraries
- event detection
- detection rate
- computer vision
- machine learning
- detection accuracy
- page segmentation