Login / Signup
A One-Layer Decoder-Only Transformer is a Two-Layer RNN: With an Application to Certified Robustness.
Yuhao Zhang
Aws Albarghouthi
Loris D'Antoni
Published in:
CoRR (2024)
Keyphrases
</>
multi layer
fuzzy logic
nearest neighbor
application layer
data sets