Login / Signup

Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters.

Euiin YiTaehyeon KimHongseok JeungDu-Seong ChangSe-Young Yun
Published in: CoRR (2024)
Keyphrases