Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models.
Boxin WangChejian XuShuohang WangZhe GanYu ChengJianfeng GaoAhmed Hassan AwadallahBo LiPublished in: NeurIPS Datasets and Benchmarks (2021)
Keyphrases
- language model
- multi task
- language modeling
- multi task learning
- probabilistic model
- document retrieval
- statistical language models
- language modelling
- learning tasks
- n gram
- retrieval model
- information retrieval
- learning problems
- transfer learning
- test collection
- multi class
- document ranking
- multiple tasks
- relevance model
- smoothing methods
- xml retrieval
- machine learning
- document collections
- knowledge discovery
- language models for information retrieval