The beta-binomial mixture model for word frequencies in documents with applications to information retrieval.
Stephen A. LowePublished in: EUROSPEECH (1999)
Keyphrases
- mixture model
- word frequencies
- information retrieval
- language model
- em algorithm
- document collections
- information retrieval systems
- generative model
- model selection
- text corpus
- probabilistic model
- expectation maximization
- relevant documents
- language modeling
- document retrieval
- maximum likelihood
- unsupervised learning
- query expansion
- vector space model
- power law distribution
- word frequency
- retrieval systems
- retrieval model
- information extraction
- search engine
- computational linguistics
- topic modeling
- machine learning
- test collection
- question answering
- text mining
- query terms
- n gram
- relevance feedback
- active learning
- k means