Login / Signup
BERTs of a feather do not generalize together: Large variability in generalization across models with similar test set performance.
R. Thomas McCoy
Junghyun Min
Tal Linzen
Published in:
CoRR (2019)
Keyphrases
</>
test set
error rate
training set
test cases
probabilistic model
test data
model selection
computer vision
cross validation
random selection