InferLine: latency-aware provisioning and scaling for prediction serving pipelines.

Published in: SoCC (2020)

Keyphrases