No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models.

Published in: NeurIPS (2023)

Keyphrases