A Comparison of Categorical Attribute Data Clustering Methods.
Ville HautamäkiAntti PöllänenTomi KinnunenKong-Aik LeeHaizhou LiPasi FräntiPublished in: S+SSPR (2014)
Keyphrases
- data sets
- data collection
- attribute values
- database
- training data
- statistical analysis
- high quality
- experimental data
- data points
- categorical attributes
- neural network
- categorical data
- data quality
- noisy data
- synthetic data
- input data
- small number
- data sources
- data structure
- decision trees
- machine learning
- knowledge discovery
- end users
- application domains
- data distribution
- missing values
- raw data
- historical data
- metadata
- numeric attributes