Login / Signup

Self-Selected Attention Span for Accelerating Large Language Model Inference.

Tian JinWanzin YazarZifei XuSayeh SharifyXin Wang
Published in: CoRR (2024)
Keyphrases