Symbolic word clustering for medium-size corpora.
Benoit HabertElie NaulleauAdeline NazarenkoPublished in: COLING (1996)
Keyphrases
- medium size
- clustering algorithm
- clustering method
- text corpus
- hierarchical clustering
- k means
- unsupervised learning
- n gram
- high level
- statistical machine translation
- text corpora
- symbolic data analysis
- neural network
- parallel corpus
- symbolic representation
- spectral clustering
- data clustering
- document clustering
- fuzzy clustering
- categorical data
- cluster analysis
- self organizing maps
- high dimensional data
- anomaly detection
- data points