EMR: Scalable Clustering of Big HR Data using Evolutionary MapReduce.
Mahdi BohlouliZhonghua HePublished in: WWW (Companion Volume) (2021)
Keyphrases
- data collection
- data points
- data sets
- data quality
- training data
- data analysis
- spectral clustering
- data sources
- original data
- synthetic data
- clustering method
- synthetic datasets
- big data
- data objects
- high dimensional data
- data warehouse
- data structure
- high quality
- database
- statistical analysis
- missing data
- attribute values
- data distribution
- image data
- high resolution
- feature space
- hierarchical clustering
- genetic algorithm
- multidimensional data
- similar objects