Login / Signup
BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences.
Sun Ao
Weilin Zhao
Xu Han
Cheng Yang
Zhiyuan Liu
Chuan Shi
Maosong Sun
Shengnan Wang
Teng Su
Published in:
CoRR (2024)
Keyphrases
</>
long sequences
main contribution
distributed systems
decision trees
multi agent
cooperative
lightweight
data model
distributed data sources
computer networks
distributed environment
theoretical framework
real time
expert systems
computer vision
real world
databases