Login / Signup

On the Expressivity Role of LayerNorm in Transformers' Attention.

Shaked BrodyUri AlonEran Yahav
Published in: ACL (Findings) (2023)
Keyphrases