Evolving controllably difficult datasets for clustering.
Cameron ShandRichard AllmendingerJulia HandlAndrew M. WebbJohn KeanePublished in: GECCO (2019)
Keyphrases
- clustering algorithm
- clustering approaches
- k means
- clustering method
- data mining tasks
- synthetic datasets
- hierarchical clustering
- data clustering
- synthetic and real datasets
- spectral clustering
- information theoretic
- categorical data
- cluster analysis
- distance metric
- outlier detection
- decision trees
- high dimensional datasets
- uci machine learning repository
- training dataset
- machine learning
- large scale data sets
- error prone
- fuzzy clustering
- similarity function
- benchmark datasets
- nearest neighbor
- real life
- training data
- website
- search engine
- genetic algorithm