Word Clustering Using PLSA Enhanced with Long Distance Bigrams.
Nikoletta BassiouConstantine KotropoulosPublished in: ICPR (2010)
Keyphrases
- long distance
- n gram
- probabilistic latent semantic analysis
- co occurrence
- mutual exclusion
- clustering algorithm
- data clustering
- k means
- hierarchical clustering
- computer technology
- named entities
- visual features
- document clustering
- multi view clustering
- word segmentation
- orders of magnitude
- cluster analysis
- training corpus
- information theoretic
- word sense disambiguation
- fuzzy clustering
- word recognition
- clustering method
- language model
- upper layer