Comparing Language Model Vocabulary Coverage on Clinical Documents.
Bryan D. SteitzAdam WrightPublished in: AMIA (2021)
Keyphrases
- language model
- document retrieval
- ad hoc information retrieval
- query terms
- information retrieval
- vector space model
- language modeling
- document ranking
- document representation
- language modeling approaches
- document level
- relevance model
- out of vocabulary
- query expansion
- retrieval model
- document length
- statistical language models
- n gram
- pseudo feedback
- word clouds
- test collection
- trec test collections
- probabilistic model
- speech recognition
- ir models
- relevant documents
- query specific
- probabilistic retrieval models
- term dependencies
- document collections
- smoothing methods
- keywords
- information retrieval systems
- retrieved documents
- multiword
- mixture model
- inter document similarities
- context sensitive
- expert finding
- language modeling framework
- document clustering
- retrieval effectiveness
- translation model
- expert search
- web documents
- trec collections
- free text
- document similarity
- pseudo relevance feedback
- latent semantic indexing
- vector space
- text documents
- tf idf
- document set
- generative model