Distributed Document Clustering Using Word-clusters.
Debzani DebRafal A. AngrykPublished in: CIDM (2007)
Keyphrases
- document clustering
- document clusters
- clustering algorithm
- document representation
- clustering method
- document collections
- text mining
- text documents
- vector space model
- text clustering
- clustering quality
- cluster analysis
- document similarity
- tf idf
- k means
- similar documents
- n gram
- co occurrence
- pairwise constraints
- data clustering
- document corpus
- tolerance rough set
- data mining
- multimedia