Almost-Constant-Time Clustering of Arbitrary Corpus Subsets.
Craig SilversteinJan O. PedersenPublished in: SIGIR (1997)
Keyphrases
- clustering algorithm
- clustering method
- k means
- high dimensional data
- cluster analysis
- self organizing maps
- graph theoretic
- categorical data
- hierarchical clustering
- test set
- information retrieval
- information theoretic
- data clustering
- spectral clustering
- unsupervised learning
- similarity function
- data points
- pairwise
- multiword