n-gram Cache Performance in Statistical Extraction of Relevant Terms in Large Corpora.
Carlos GonçalvesJoaquim F. SilvaJosé C. CunhaPublished in: ICCS (2) (2019)
Keyphrases
- n gram
- language model
- variable length
- text classification
- language modeling
- language independent
- character n grams
- co occurrence
- language modelling
- bag of words
- viterbi algorithm
- statistical language modeling
- part of speech
- word segmentation
- query terms
- query processing
- out of vocabulary
- natural language processing
- data mining