Login / Signup
Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration.
Zhongzhi Yu
Zheng Wang
Yonggan Fu
Huihong Shi
Khalid Shaikh
Yingyan Celine Lin
Published in:
CoRR (2024)
Keyphrases
</>
language model
language modeling
n gram
document retrieval
language modelling
retrieval model
decision trees
training set
query expansion
relevance model
statistical language models