Sign in

Towards Optimal Preemptive GPU Time-Sharing for Edge Model Serving.

Zhengxu XiaYitian HaoJun DuanChen WangJunchen Jiang
Published in: WOC@Middleware (2023)
Keyphrases