Exploring a new space of features for document classification: figure clustering.
Nawei ChenHagit ShatkayDorothea BlosteinPublished in: CASCON (2006)
Keyphrases
- document classification
- topic extraction
- text categorization
- k means
- text mining
- feature extraction
- data sets
- co occurrence
- feature space
- feature set
- natural language processing
- achieve high classification accuracy
- clustering method
- wordnet
- unsupervised learning
- knowledge representation
- prior knowledge
- feature vectors
- pairwise
- object recognition
- clustering algorithm
- learning algorithm