Login / Signup
Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU.
Fuxun Yu
Shawn Bray
Di Wang
Longfei Shangguan
Xulong Tang
Chenchen Liu
Xiang Chen
Published in:
ICCAD (2021)
Keyphrases
</>
multi tenant
scheduling problem
data center
real time
parallel computation
parallel computing
parallel machines
information systems
relational databases
data sources