Beyond Attention: Breaking the Limits of Transformer Context Length with Recurrent Memory.

Aydar BulatovYuri KuratovYermek KapushevMikhail Burtsev
Published in: AAAI (2024)
Keyphrases
  • real time
  • fuzzy logic
  • contextual information
  • power system
  • recurrent neural networks
  • context sensitive
  • databases
  • neural network
  • expert systems
  • high speed
  • fault diagnosis
  • feed forward
  • fixed length
  • computing power