The Effects of Corpus Size and Homogeneity on Language Model Quality.
Tony G. RosePublished in: VLC (1997)
Keyphrases
- language model
- language modeling
- document level
- n gram
- document retrieval
- probabilistic model
- statistical machine translation
- query expansion
- information retrieval
- statistical language models
- multiword
- retrieval model
- ad hoc information retrieval
- test collection
- speech recognition
- context sensitive
- language modelling
- translation model
- mixture model
- smoothing methods
- language model for information retrieval
- relevance model
- document length
- pseudo relevance feedback
- dirichlet prior
- query terms
- statistical model