Login / Signup

The Synergy of Speculative Decoding and Batching in Serving Large Language Models.

Qidong SuChristina GiannoulaGennady Pekhimenko
Published in: CoRR (2023)
Keyphrases