Efficient data distribution and results merging for parallel data clustering in mapreduce environment.
Abdelhak BousbaciNadjet KamelPublished in: Appl. Intell. (2018)
Keyphrases
- data clustering
- data distribution
- multi dimensional data
- parallel processing
- clustering algorithm
- data skew
- k means
- unsupervised learning
- data points
- data streams
- deterministic annealing
- clustering ensemble
- decision boundary
- cluster analysis
- spectral clustering
- concept drift
- index structure
- high dimensional data
- image data
- streaming data
- machine learning
- cloud computing
- databases