Using parsimonious language models on web data.
Rianne KapteinRongmei LiDjoerd HiemstraJaap KampsPublished in: SIGIR (2008)
Keyphrases
- web data
- language model
- web mining
- language modeling
- probabilistic model
- n gram
- document retrieval
- web usage mining
- retrieval model
- web pages
- query terms
- semi structured
- language modelling
- web documents
- information retrieval
- web content
- test collection
- query expansion
- query logs
- deep web
- document ranking
- vector space model
- smoothing methods
- data model
- link analysis
- document representation
- image retrieval
- link structure
- relevance model
- data mining
- statistical language models