Login / Signup
Predicting Attention Sparsity in Transformers.
Marcos V. Treviso
António Góis
Patrick Fernandes
Erick Rocha Fonseca
André F. T. Martins
Published in:
SPNLP@ACL (2022)
Keyphrases
</>
high dimensional
artificial intelligence
sparse representation
genetic algorithm
search engine
information systems
feature selection
similarity measure
focus of attention