Login / Signup

Predicting Attention Sparsity in Transformers.

Marcos V. TrevisoAntónio GóisPatrick FernandesErick Rocha FonsecaAndré F. T. Martins
Published in: SPNLP@ACL (2022)
Keyphrases
  • high dimensional
  • artificial intelligence
  • sparse representation
  • genetic algorithm
  • search engine
  • information systems
  • feature selection
  • similarity measure
  • focus of attention