A Generalized Language Model as the Combination of Skipped n-grams and Modified Kneser-Ney Smoothing.
Rene PickhardtThomas GottronMartin KörnerPaul Georg WagnerTill SpeicherSteffen StaabPublished in: CoRR (2014)
Keyphrases
- language model
- n gram
- smoothing methods
- language modeling
- statistical language modeling
- language models for information retrieval
- document retrieval
- probabilistic model
- language modelling
- speech recognition
- dirichlet prior
- bag of words
- retrieval model
- document length
- information retrieval
- query expansion
- language independent
- context sensitive
- language modeling framework
- mixture model
- test collection
- part of speech
- jelinek mercer
- translation model
- pseudo relevance feedback
- document ranking
- word segmentation
- document representation
- vector space model
- query terms
- ad hoc information retrieval
- out of vocabulary
- document collections
- data mining