Login / Signup
Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer.
Qingru Zhang
Dhananjay Ram
Cole Hawkins
Sheng Zha
Tuo Zhao
Published in:
EMNLP (Findings) (2023)
Keyphrases
</>
long range
short range
conditional random fields
long range correlations
prior knowledge
computer vision
similarity measure
video sequences