Login / Signup

Do pretrained Transformers Really Learn In-context by Gradient Descent?

Lingfeng ShenAayush MishraDaniel Khashabi
Published in: CoRR (2023)
Keyphrases
  • contextual information
  • context aware
  • learning rules
  • databases
  • real world
  • genetic algorithm
  • multiscale
  • data structure
  • special case
  • conceptual model
  • context sensitive