Frequent Itemset Based Hierarchical Document Clustering Using Wikipedia as External Knowledge.
Kiran G. V. R.Ravi ShankarVikram PudiPublished in: KES (2) (2010)
Keyphrases
- external knowledge
- hierarchical document clustering
- frequent itemsets
- k nearest neighbor
- text categorization
- knn
- itemsets
- frequent itemset mining
- domain knowledge
- association rules
- wordnet
- association rule mining
- frequent itemsets mining
- data structure
- mining algorithm
- data streams
- nearest neighbor
- frequent itemset discovery
- document clustering
- knowledge sources
- textual information
- text classification
- frequent patterns
- semantic features
- frequent closed itemsets
- knowledge structures
- web directories
- bag of words
- co occurrence
- search engine