Login / Signup

Recursive Speculative Decoding: Accelerating LLM Inference via Sampling Without Replacement.

Wonseok JeonMukul GagraniRaghavv GoelJunyoung ParkMingu LeeChristopher Lott
Published in: CoRR (2024)
Keyphrases