BERTs of a feather do not generalize together: Large variability in generalization across models with similar test set performance.

Published in: CoRR (2019)

Keyphrases