Multi-armed bandits for online optimization of language model pre-training: the use case of dynamic masking.

Published in: CoRR (2022)

Keyphrases