CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models.
Yuanjie LyuZhiyu LiSimin NiuFeiyu XiongBo TangWenjin WangHao WuHuanyong LiuTong XuEnhong ChenPublished in: CoRR (2024)
Keyphrases
- language model
- retrieval model
- document retrieval
- query expansion
- test collection
- language modeling
- language models for information retrieval
- ad hoc information retrieval
- information retrieval
- cross language retrieval
- query terms
- statistical language models
- smoothing methods
- n gram
- document ranking
- document length
- probabilistic model
- relevance model
- speech recognition
- query specific
- language modelling
- document level
- passage retrieval
- text retrieval
- term dependencies
- pseudo relevance feedback
- out of vocabulary
- context sensitive
- ir models
- statistical language modeling
- word segmentation
- retrieval effectiveness
- vector space model
- pseudo feedback
- image retrieval
- retrieval systems
- information retrieval systems
- chinese english
- text summarization
- okapi bm
- image database
- retrieval process
- term frequency
- tf idf
- language modeling approaches
- ad hoc retrieval
- term weighting