Sign in

Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking.

Iñigo UrteagaMoulay-Zaïdane DraïdiaTomer LancewickiShahram Khadivi
Published in: ACL (Findings) (2023)
Keyphrases