Linear Time Clustering for High Dimensional Mixtures of Gaussian Clouds.
Dan KushnirShirin JalaliIraj SanieePublished in: CoRR (2017)
Keyphrases
- high dimensional
- high dimensionality
- clustering algorithm
- data points
- gaussian mixture
- k means
- mixtures of gaussians
- high dimensional data
- data clustering
- hierarchical clustering
- low dimensional
- dimensionality reduction
- clustering method
- high dimensional data sets
- unsupervised learning
- categorical data
- nearest neighbor
- high dimensional datasets
- mixture distribution
- similarity search
- outlier detection
- worst case
- metric space
- document clustering
- model selection
- discrete data
- heavy tailed
- multi dimensional
- pairwise