Clustering very large high-dimentional datasets based entropy with MapReduce.
Fangjun LuanJiadi LiuKeyan CaoPublished in: ICNC-FSKD (2016)
Keyphrases
- information theoretic
- synthetic datasets
- high dimensional datasets
- synthetic and real datasets
- clustering method
- data mining tasks
- clustering approaches
- unsupervised learning
- data clustering
- clustering algorithm
- data partitioning
- k means
- hierarchical clustering
- mutual information
- anomaly detection
- synthetic and real life datasets
- data points
- database
- parallel processing
- self organizing maps
- high precision
- high dimensional data
- wide range
- similarity measure
- learning algorithm