Unsupervised Classification of Structurally Similar Document Images.
Jayant KumarDavid S. DoermannPublished in: ICDAR (2013)
Keyphrases
- document images
- unsupervised classification
- document image analysis
- document analysis
- unsupervised learning
- page segmentation
- printed documents
- data clustering
- optical character recognition
- remote sensing data
- historical documents
- supervised classification
- hyperspectral images
- page layout
- word spotting
- scanned documents
- clustering ensemble
- handwritten documents
- text mining
- k means
- text retrieval
- pairwise
- pattern recognition
- reinforcement learning
- learning algorithm