Frequency-Based vs. Knowledge-Based Similarity Measures for Categorical Data.
Summaya MumtazMartin GiesePublished in: AAAI Spring Symposium: Combining Machine Learning with Knowledge Engineering (1) (2020)
Keyphrases
- categorical data
- similarity measure
- cluster analysis
- numerical data
- parameter free
- numeric data
- binary data
- distance based outlier detection
- data mining
- correspondence analysis
- feature vectors
- clustering method
- databases
- euclidean distance
- hierarchical latent class models
- multivariate time series
- dimensionality reduction
- text mining
- prior knowledge
- pairwise
- machine learning
- real world