• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU.

Fuxun YuShawn BrayDi WangLongfei ShangguanXulong TangChenchen LiuXiang Chen
Published in: ICCAD (2021)
Keyphrases
  • multi tenant
  • scheduling problem
  • data center
  • real time
  • parallel computation
  • parallel computing
  • parallel machines
  • information systems
  • relational databases
  • data sources