Structure-Aware Language Model Pretraining Improves Dense Retrieval on Structured Data.
Xinze LiZhenghao LiuChenyan XiongShi YuYu GuZhiyuan LiuGe YuPublished in: ACL (Findings) (2023)
Keyphrases
- structured data
- language model
- retrieval model
- document retrieval
- structured information
- ad hoc information retrieval
- test collection
- language modeling
- information retrieval
- query expansion
- semi structured
- n gram
- language models for information retrieval
- unstructured data
- query terms
- smoothing methods
- probabilistic model
- semi structured data
- relevance model
- document length
- information extraction
- xml documents
- query specific
- structured queries
- linked data
- keyword queries
- textual data
- retrieval effectiveness
- vector space model
- retrieval systems
- pseudo relevance feedback
- tf idf
- keyword search
- database
- web documents
- information retrieval systems
- inter document similarities
- data sources
- dirichlet prior
- data sets