Login / Signup

DeepLat: Achieving Minimum Worst Case Latency for DNN Inference with Batch-Aware Dispatching.

Jiaheng GaoYitao Hu
Published in: ICA3PP (1) (2023)
Keyphrases