What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?

Published in: CoRR (2022)

Keyphrases