Documents and queries as random variables: History and implications.
David BodoffSamuel Po-Shing WongPublished in: J. Assoc. Inf. Sci. Technol. (2006)
Keyphrases
- random variables
- user queries
- graphical models
- query terms
- retrieval systems
- probability distribution
- boolean queries
- relevant documents
- distributed information retrieval
- information retrieval
- information retrieval systems
- query language
- joint distribution
- inverted index
- conditional independence
- query processing
- bayesian networks
- stochastic optimization problems
- distribution function
- latent variables
- independent and identically distributed
- document classification
- document clustering
- web search engines
- web documents
- document collections
- random vectors
- probabilistic graphical models
- joint probability distribution
- statistically independent
- xml documents
- co occurrence
- normal distribution
- marginal distributions
- search history
- keywords
- conditional distribution
- conditional distributions
- text documents
- conditional probabilities
- search engine
- document retrieval
- information extraction
- failure rate