Login / Signup
GMM: An Efficient GPU Memory Management-based Model Serving System for Multiple DNN Inference Models.
XinYu Piao
Jong-Kook Kim
Published in:
ICPP (2024)
Keyphrases
</>
probabilistic model
real time
management system
databases
data structure
low cost
variational bayes