LinkBERT: Pretraining Language Models with Document Links.
Michihiro YasunagaJure LeskovecPercy LiangPublished in: CoRR (2022)
Keyphrases
- language model
- document retrieval
- document ranking
- document length
- ad hoc information retrieval
- language modeling
- information retrieval
- query terms
- document representation
- vector space model
- language modeling approaches
- document level
- n gram
- passage retrieval
- probabilistic model
- speech recognition
- test collection
- query specific
- retrieval model
- relevance model
- word clouds
- smoothing methods
- query expansion
- relevant documents
- language models for information retrieval
- pseudo feedback
- language modelling
- term dependencies
- document collections
- language modeling framework
- pseudo relevance feedback
- statistical language models
- context sensitive
- information retrieval systems
- retrieval systems
- okapi bm
- language model for information retrieval
- retrieval effectiveness
- probabilistic retrieval models
- cross lingual
- link structure
- term frequency
- search engine
- document clustering
- tf idf
- translation model
- document similarity
- document structure
- average precision
- term weighting
- statistical language modeling
- user queries
- text classification
- keywords
- spoken term detection
- expert search