Login / Signup
DeInfer: A GPU resource allocation algorithm with spatial sharing for near-deterministic inferring tasks.
Yingwen Chen
Wenxin Li
Huan Zhou
Xiangrui Yang
Yanfei Yin
Published in:
ICPP (2024)
Keyphrases
</>
resource allocation
learning algorithm
dynamic programming
objective function
optimal solution
computational complexity
resource management
resource allocation problems
real time
resource sharing
np hard
linear programming