Pervasive Label Errors in Test Sets Destabilize Machine Learning Benchmarks.

Curtis G. Northcutt Anish Athalye Jonas Mueller

Published in: NeurIPS Datasets and Benchmarks (2021)

Keyphrases