Login / Signup

Recurrent Drafter for Fast Speculative Decoding in Large Language Models.

Aonan ZhangChong WangYi WangXuanyu ZhangYunfei Cheng
Published in: CoRR (2024)
Keyphrases