Login / Signup

Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding.

Jun ZhangJue WangHuan LiLidan ShouKe ChenGang ChenSharad Mehrotra
Published in: CoRR (2023)
Keyphrases