Sign in

Head-wise Shareable Attention for Large Language Models.

Zouying CaoYifei YangHai Zhao
Published in: CoRR (2024)
Keyphrases