Sign in

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding.

Yichao FuPeter BailisIon StoicaHao Zhang
Published in: CoRR (2024)
Keyphrases