Login / Signup
Accelerating Production LLMs with Combined Token/Embedding Speculators.
Davis Wertheimer
Joshua Rosenkranz
Thomas P. Parnell
Sahil Suneja
Pavithra Ranganathan
Raghu K. Ganti
Mudhakar Srivatsa
Published in:
CoRR (2024)
Keyphrases
</>
expert systems
machine learning
knowledge base
image processing
website
production line