Evaluating NLP Models via Contrast Sets.
Matt GardnerYoav ArtziVictoria BasmovaJonathan BerantBen BoginSihao ChenPradeep DasigiDheeru DuaYanai ElazarAnanth GottumukkalaNitish GuptaHanna HajishirziGabriel IlharcoDaniel KhashabiKevin LinJiangming LiuNelson F. LiuPhoebe MulcaireQiang NingSameer SinghNoah A. SmithSanjay SubramanianReut TsarfatyEric WallaceAlly ZhangBen ZhouPublished in: CoRR (2020)