Recursive Speculative Decoding: Accelerating LLM Inference via Sampling Without Replacement.
Wonseok JeonMukul GagraniRaghavv GoelJunyoung ParkMingu LeeChristopher LottPublished in: CoRR (2024)
Keyphrases
- markov chain monte carlo
- gibbs sampler
- monte carlo
- metropolis hastings
- decoding algorithm
- bayesian networks
- probabilistic inference
- neural network
- bayesian inference
- inference process
- sample size
- belief networks
- markov chain
- sampling algorithm
- gibbs sampling
- recursive algorithm
- probabilistic model
- recursive functions
- joint detection