EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and Voting.
Zhongzhi YuZheng WangYuhan LiHaoran YouRuijie GaoXiaoya ZhouSreenidhi Reedy BommuYang Katie ZhaoYingyan Celine LinPublished in: CoRR (2024)