Login / Signup

Minions: Accelerating Large Language Model Inference with Adaptive and Collective Speculative Decoding.

Siqi WangHailong YangXuezhu WangTongxuan LiuPengbo WangXuning LiangKejie MaTianyu FengXin YouYongjun BaoYi LiuZhongzhi LuanDepei Qian
Published in: CoRR (2024)
Keyphrases