Login / Signup

The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit.

Lorenzo NociChuning LiMufan Bill LiBobby HeThomas HofmannChris MaddisonDaniel M. Roy
Published in: CoRR (2023)
Keyphrases