Login / Signup

Simple linear attention language models balance the recall-throughput tradeoff.

Simran AroraSabri EyubogluMichael ZhangAman TimalsinaSilas AlbertiDylan ZinsleyJames ZouAtri RudraChristopher Ré
Published in: CoRR (2024)
Keyphrases