Stochastic language model for analyzing document physical layout.
Tapas KanungoSong MaoPublished in: Document Recognition and Retrieval (2002)
Keyphrases
- language model
- physical layout
- document retrieval
- document ranking
- ad hoc information retrieval
- information retrieval
- document length
- document representation
- query terms
- word clouds
- language modeling
- vector space model
- language modeling approaches
- document level
- query specific
- n gram
- probabilistic model
- speech recognition
- query expansion
- language modeling framework
- pseudo feedback
- language modelling
- test collection
- okapi bm
- term dependencies
- probabilistic retrieval models
- language model for information retrieval
- retrieval model
- context sensitive
- relevance model
- jelinek mercer
- smoothing methods
- retrieval systems
- retrieved documents
- translation model
- document collections
- multiword
- web documents
- relevant documents
- database
- information retrieval systems
- mixture model
- language models for information retrieval
- term frequency
- pseudo relevance feedback
- user queries
- term weighting
- retrieval effectiveness
- co occurrence
- textual documents
- ad hoc retrieval