Similarity Measures for Categorical Data: A Comparative Evaluation.
Shyam BoriahVarun ChandolaVipin KumarPublished in: SDM (2008)
Keyphrases
- comparative evaluation
- categorical data
- similarity measure
- cluster analysis
- parameter free
- numerical data
- numeric data
- attribute values
- distance measure
- distance based outlier detection
- databases
- feature vectors
- pairwise
- clustering method
- scoring methods
- correspondence analysis
- pattern recognition
- hierarchical latent class models
- text categorization
- unsupervised learning
- machine learning