Sign in

Performance Efficient Layer-aware DNN Inference Task Scheduling in GPU Cluster.

Hongmin GengDeze ZengYuepeng Li
Published in: GLOBECOM (2022)
Keyphrases