Frequency Consolidation Among Word N-Grams - A Practical Procedure.
Andreas BuerkiPublished in: Europhras (2017)
Keyphrases
- n gram
- language model
- bag of words
- language independent
- document frequency
- word segmentation
- text classification
- variable length
- real world
- web documents
- part of speech
- language modeling
- co occurrence
- viterbi algorithm
- language modelling
- character n grams
- word level
- databases
- data mining
- relevance ranking
- inside outside algorithm
- question answering
- out of vocabulary
- statistical language modeling