Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging.
Deyuan LiuZhanyue QinHairu WangZhao YangZecheng WangFangying RongQingbin LiuYanchao HaoXi ChenCunhang FanZhao LvZhiying TuDianhui ChuDianbo SuiPublished in: CoRR (2024)