Login / Signup

S2D: Sorted Speculative Decoding For More Efficient Deployment of Nested Large Language Models.

Parsa KavehzadehMohammadreza PourrezaMojtaba ValipourTinashu ZhuHaoli BaiAli GhodsiBoxing ChenMehdi Rezagholizadeh
Published in: CoRR (2024)
Keyphrases