C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU.
Fuxun Yu
Shawn Bray
Di Wang
Longfei Shangguan
Xulong Tang
Chenchen Liu
Xiang Chen
Published in:
ICCAD (2021)
Keyphrases
</>
multi tenant
scheduling problem
data center
real time
parallel computation
parallel computing
parallel machines
information systems
relational databases
data sources