Login / Signup
HiRE: High Recall Approximate Top-k Estimation for Efficient LLM Inference.
Yashas Samaga
Varun Yerram
Chong You
Srinadh Bhojanapalli
Sanjiv Kumar
Prateek Jain
Praneeth Netrapalli
Published in:
CoRR (2024)
Keyphrases
</>
high recall
high precision
data sets
cost effective
user defined
accurate estimation
neural network
machine learning
query processing
parameter estimation
probabilistic inference
estimation algorithm
efficient learning
inference process