Login / Signup

Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU.

Fuxun YuShawn BrayDi WangLongfei ShangguanXulong TangChenchen LiuXiang Chen
Published in: ICCAD (2021)
Keyphrases
  • multi tenant
  • scheduling problem
  • data center
  • real time
  • parallel computation
  • parallel computing
  • parallel machines
  • information systems
  • relational databases
  • data sources