Producing Power-Law Distributions and Damping Word Frequencies with Two-Stage Language Models.
Sharon GoldwaterThomas L. GriffithsMark JohnsonPublished in: J. Mach. Learn. Res. (2011)
Keyphrases
- language model
- word frequencies
- power law distribution
- dependency structure
- language modeling
- n gram
- information retrieval
- probabilistic model
- document retrieval
- text corpus
- retrieval model
- scale free
- long tail
- query expansion
- test collection
- power law
- social networks
- translation model
- small world
- pseudo relevance feedback
- vector space model