Login / Signup
Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU.
Fuxun Yu
Shawn Bray
Di Wang
Longfei Shangguan
Xulong Tang
Chenchen Liu
Xiang Chen
Published in:
CoRR (2021)
Keyphrases
</>
multi tenant
scheduling problem
real time
data center
parallel machines
databases
information systems
low cost