KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches.
Jiayi YuanHongyi LiuShaochen ZhongYu-Neng ChuangSongchen LiGuanchu WangDuy LeHongye JinVipin ChaudharyZhaozhuo XuZirui LiuXia HuPublished in: CoRR (2024)