Login / Signup

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck.

Nathan GodeyÉric de la ClergerieBenoît Sagot
Published in: CoRR (2024)
Keyphrases