An Approach to Multiple Comparison Benchmark Evaluations that is Stable Under Manipulation of the Comparate Set.
Ali Ismail-FawazAngus DempsterChang Wei TanMatthieu HerrmannLynn MillerDaniel F. SchmidtStefano BerrettiJonathan WeberMaxime DevanneGermain ForestierGeoffrey I. WebbPublished in: CoRR (2023)