Sign in

What can a Single Attention Layer Learn? A Study Through the Random Features Lens.

Hengyu FuTianyu GuoYu BaiSong Mei
Published in: CoRR (2023)
Keyphrases