Login / Signup

Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding.

Zhuoming ChenAvner MayRuslan SvirschevskiYuhsun HuangMax RyabininZhihao JiaBeidi Chen
Published in: CoRR (2024)
Keyphrases