Hybrid N-gram Probability Estimation in Morphologically Rich Languages.
Hyopil ShinHyun-Jo YouPublished in: PACLIC (2009)
Keyphrases
- n gram
- probability estimation
- language independent
- language specific
- text classification
- language model
- character n grams
- naive bayes
- decision trees
- variable length
- multi class classification
- language modeling
- part of speech
- word segmentation
- cross lingual
- roc curve
- hidden markov models
- pairwise
- training data
- web pages