MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation.
Zexue HeYu WangAn YanYao LiuEric Y. ChangAmilcare GentiliJulian J. McAuleyChun-Nan HsuPublished in: EMNLP (2023)
Keyphrases
- language model
- multi task
- multi domain
- language modeling
- multi task learning
- n gram
- probabilistic model
- smoothing methods
- cross domain
- multi class
- learning tasks
- test collection
- learning problems
- information retrieval
- domain specific
- metric learning
- gaussian processes
- feature selection
- data sets
- learning styles
- information gain
- model selection
- natural language
- learning algorithm