Login / Signup

A One-Layer Decoder-Only Transformer is a Two-Layer RNN: With an Application to Certified Robustness.

Yuhao ZhangAws AlbarghouthiLoris D'Antoni
Published in: CoRR (2024)
Keyphrases
  • multi layer
  • fuzzy logic
  • nearest neighbor
  • application layer
  • data sets