On Saving Outliers for Better Clustering over Noisy Data.
Shaoxu SongFei GaoRuihong HuangYihan WangPublished in: SIGMOD Conference (2021)
Keyphrases
- noisy data
- missing data
- high dimensionality
- noise tolerant
- k means
- intrinsic dimensionality
- clustering algorithm
- clustering method
- categorical data
- cluster analysis
- spectral clustering
- noise free
- outlier detection
- input data
- data points
- high dimensional
- learning from noisy data
- missing values
- hierarchical clustering
- nearest neighbor
- training data
- image data