Algorithms for bigram and trigram word clustering.
Sven C. MartinJörg LiermannHermann NeyPublished in: Speech Commun. (1998)
Keyphrases
- data clustering
- orders of magnitude
- data structure
- n gram
- language model
- word segmentation
- computational complexity
- computational cost
- theoretical analysis
- computationally efficient
- text mining
- self organizing maps
- unsupervised learning
- data mining techniques
- optimization problems
- co occurrence
- learning algorithm
- worst case
- significant improvement
- evolutionary algorithm
- pairwise
- lower bound
- keywords
- decision trees
- clustering algorithm