Login / Signup
In-Context Learning of a Linear Transformer Block: Benefits of the MLP Component and One-Step GD Initialization.
Ruiqi Zhang
Jingfeng Wu
Peter L. Bartlett
Published in:
CoRR (2024)
Keyphrases
</>
learning algorithm
learning process
learning systems
neural network
reinforcement learning
online learning
learning phase
expert systems
supervised learning
post processing
learning problems
incremental learning
learning analytics
radial basis function network