MedBench: A Comprehensive, Standardized, and Reliable Benchmarking System for Evaluating Chinese Medical Large Language Models.
Mianxin LiuJinru DingJie XuWeiguo HuXiaoyang LiLifeng ZhuZhian BaiXiaoming ShiBenyou WangHaitao SongPengfei LiuXiaofan ZhangShanshan WangKang LiHaofen WangTong RuanXuanjing HuangXin SunShaoting ZhangPublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- probabilistic model
- language modelling
- n gram
- document retrieval
- retrieval model
- word segmentation
- query expansion
- information retrieval
- speech recognition
- context sensitive
- statistical language models
- smoothing methods
- test collection
- language model for information retrieval
- ad hoc information retrieval
- pseudo relevance feedback
- term dependencies
- machine learning
- vector space model
- relevance feedback
- passage retrieval
- document length