A clustering-based approach to the separation of text strings from mixed text/graphics documents.
Shoujie HeNorihiro AbePublished in: ICPR (1996)
Keyphrases
- text graphics
- document clustering
- text clustering
- text documents
- shape primitives
- engineering drawings
- information retrieval
- color palette
- clustering algorithm
- k means
- document collections
- text mining
- information retrieval systems
- web documents
- text collections
- free text
- document analysis
- text data
- image blocks
- digital documents
- string matching
- xml documents
- video sequences
- edit distance
- text categorization
- related documents
- text classification
- metadata