Reduced n-gram Models for English and Chinese Corpora.
Le Quan HaPhilip HannaDarryl StewartFrancis Jack SmithPublished in: ACL (2006)
Keyphrases
- n gram
- language model
- word segmentation
- text classification
- character n grams
- language modelling
- statistical language modeling
- probabilistic model
- finite state transducers
- natural language processing
- variable length
- machine translation
- parallel corpus
- viterbi algorithm
- language independent
- natural language
- bag of words
- knowledge representation
- hidden markov models