Text Extraction from Color Documents - Clustering Approaches in Three and Four Dimensions.
T. PerroudKarin SobottkaHorst BunkeLawrence O. HallPublished in: ICDAR (2001)
Keyphrases
- text extraction
- clustering approaches
- document clustering
- document classification
- text information
- complex background
- natural scenes
- clustering method
- information retrieval
- clustering algorithm
- text documents
- document collections
- text mining
- information retrieval systems
- color quantization
- text processing
- web documents
- text categorization
- subspace clustering
- textual information
- k means
- data objects
- hierarchical clustering
- optical character recognition
- semi supervised
- data sets
- text regions
- keywords
- feature extraction