Improving clustering efficiency by SimHash-based K-Means algorithm for big data analytics.
Jenq-Haur WangJia-Zhi LinPublished in: IEEE BigData (2016)
Keyphrases
- k means
- clustering method
- clustering algorithm
- data clustering
- learning algorithm
- expectation maximization
- hierarchical clustering
- clustering approaches
- spectral clustering
- document clustering
- rough k means
- fuzzy k means
- high efficiency
- cluster analysis
- computational complexity
- distance metric
- self organizing maps
- clustering quality
- clustering analysis
- machine learning
- user interface
- cluster structure
- clustering ensemble
- similarity measure
- image segmentation
- classical clustering algorithms
- kohonen self organizing maps