Using Representative-Based Clustering for Nearest Neighbor Dataset Editing.
Christoph F. EickNidal M. ZeidatRicardo VilaltaPublished in: ICDM (2004)
Keyphrases
- nearest neighbor
- high dimensional data
- clustering algorithm
- k nearest neighbor
- data points
- nearest neighbor algorithm
- k nearest
- synthetic datasets
- high dimensional
- k means
- high dimensional datasets
- knn
- representative set
- data clustering
- nearest neighbor classification
- unsupervised learning
- benchmark datasets
- database
- nearest neighbor search
- high dimensionality
- categorical data
- information theoretic
- hierarchical clustering
- document clustering
- graph theoretic
- cluster analysis
- distance function
- low dimensional
- cluster structure
- training set
- machine learning
- data mining