Login / Signup
Break the Sequential Dependency of LLM Inference Using Lookahead Decoding.
Yichao Fu
Peter Bailis
Ion Stoica
Hao Zhang
Published in:
CoRR (2024)
Keyphrases
</>
probabilistic inference
bayesian networks
bayesian inference
efficient inference algorithms
decoding algorithm
artificial intelligence
data structure
data sets
inference engine
graphical models
efficient learning
sequential data
belief networks
lower bound
information systems
machine learning
neural network