Login / Signup

Scaling Studies for Efficient Parameter Search and Parallelism for Large Language Model Pre-training.

Michael BeningtonLeo PhanChris Pierre PaulEvan ShoemakerPriyanka RanadeTorstein CollettGrant Hodgson PerezChristopher Krieger
Published in: CoRR (2023)
Keyphrases