Algorithms for bigram and trigram word clustering.
Sven C. MartinJörg LiermannHermann NeyPublished in: EUROSPEECH (1995)
Keyphrases
- n gram
- word segmentation
- clustering algorithm
- language model
- theoretical analysis
- data clustering
- hidden markov models
- optimization problems
- clustering method
- computationally efficient
- data sets
- computational cost
- hierarchical clustering
- computational complexity
- data structure
- synthetic datasets
- text classification
- document clustering
- cluster analysis
- machine learning algorithms
- k means
- information retrieval
- neural network