Exploiting Distribution Skew for Scalable P2P Text Clustering.
Odysseas PapapetrouWolf SiberskiFabian LeitritzWolfgang NejdlPublished in: DBISP2P (2008)
Keyphrases
- text clustering
- document clustering
- text mining
- clustering algorithm
- hierarchical clustering
- text categorization
- k means
- text data
- background knowledge
- text documents
- metric learning
- text collections
- collaborative filtering
- wordnet
- self organizing maps
- pairwise
- user feedback
- natural language
- clustering quality
- knowledge base