Clustering very large multi-dimensional datasets with MapReduce.
Robson Leonardo Ferreira CordeiroCaetano Traina Jr.Agma Juci Machado TrainaJulio César López-HernándezU KangChristos FaloutsosPublished in: KDD (2011)
Keyphrases
- multi dimensional
- multi dimensional data
- clustering algorithm
- k means
- data mining tasks
- high dimensional data sets
- clustering method
- synthetic datasets
- synthetic and real datasets
- clustering approaches
- cloud computing
- data sets
- high dimensional datasets
- dimensional data
- self organizing maps
- data clustering
- spectral clustering
- cluster analysis
- unsupervised learning
- range queries
- index structure
- parallel processing
- fuzzy clustering
- distributed computing
- high dimensionality
- data management
- database