Login / Signup

DeInfer: A GPU resource allocation algorithm with spatial sharing for near-deterministic inferring tasks.

Yingwen ChenWenxin LiHuan ZhouXiangrui YangYanfei Yin
Published in: ICPP (2024)
Keyphrases