Robust Evaluation Measures for Evaluating Social Biases in Masked Language Models.
Yang LiuPublished in: CoRR (2024)
Keyphrases
- language model
- evaluation measures
- ir models
- language modeling
- learning to rank
- document retrieval
- information retrieval
- n gram
- precision and recall
- speech recognition
- language modelling
- probabilistic model
- test collection
- retrieval systems
- retrieval model
- query expansion
- statistical language models
- smoothing methods
- query terms
- language models for information retrieval
- evaluation metrics
- benchmark datasets
- relevance model
- document ranking
- ranked list
- information extraction
- pseudo relevance feedback
- relevance judgments
- trec collections
- active learning
- decision trees