Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking.

Published in: ACL (Findings) (2023)

Keyphrases