Login / Signup

Spike No More: Stabilizing the Pre-training of Large Language Models.

Sho TakaseShun KiyonoSosuke KobayashiJun Suzuki
Published in: CoRR (2023)
Keyphrases