Unsupervised Clustering with Smoothing for Detecting Paratext Boundaries in Scanned Documents.
Ana LucicRobin BurkeJohn ShanahanPublished in: JCDL (2019)
Keyphrases
- unsupervised clustering
- scanned documents
- noise removal
- supervised classification
- k means
- image segmentation
- clustering method
- clustering algorithm
- text detection
- document images
- semi supervised clustering
- spectral clustering
- document clustering
- fuzzy c means
- multi class
- digital libraries
- multiscale
- image processing
- image enhancement
- image restoration
- optical character recognition