Enhanced co-occurrence distances for categorical data in unsupervised learning.
Jia-Yi FengMingchun WangCan WangLongbing CaoPublished in: ICMLC (2010)
Keyphrases
- co occurrence
- categorical data
- unsupervised learning
- cluster analysis
- numerical data
- parameter free
- supervised learning
- semantic similarity
- wordnet
- named entities
- object recognition
- hierarchical latent class models
- feature selection
- attribute values
- semantic relations
- real world
- distance measure
- dimensionality reduction
- machine learning
- model selection
- expectation maximization
- natural language processing
- semi supervised
- knowledge discovery
- latent semantic analysis
- k means
- occurrence frequency
- distance based outlier detection
- data sets