Login / Signup

Accelerating LLM Inference with Staged Speculative Decoding.

Benjamin SpectorChris Re
Published in: CoRR (2023)
Keyphrases