Login / Signup

GMM: An Efficient GPU Memory Management-based Model Serving System for Multiple DNN Inference Models.

XinYu PiaoJong-Kook Kim
Published in: ICPP (2024)
Keyphrases
  • probabilistic model
  • real time
  • management system
  • databases
  • data structure
  • low cost
  • variational bayes