Login / Signup
Paella: Low-latency Model Serving with Software-defined GPU Scheduling.
Kelvin K. W. Ng
Henri Maxime Demoulin
Vincent Liu
Published in:
SOSP (2023)
Keyphrases
</>
low latency
real time
relational databases
end to end