Login / Signup

EMS-SD: Efficient Multi-sample Speculative Decoding for Accelerating Large Language Models.

Yunsheng NiChuanjian LiuYehui TangKai HanYunhe Wang
Published in: CoRR (2024)
Keyphrases