Login / Signup

Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens.

Zhanpeng ZengCole HawkinsMingyi HongAston ZhangNikolaos PappasVikas SinghShuai Zheng
Published in: CoRR (2023)
Keyphrases
  • line segments
  • data mining
  • decision making
  • real time
  • genetic algorithm
  • data analysis
  • information technology
  • multi objective
  • dynamic programming