Login / Signup

InferLine: latency-aware provisioning and scaling for prediction serving pipelines.

Daniel CrankshawGur-Eyal SelaXiangxi MoCorey ZumarIon StoicaJoseph GonzalezAlexey Tumanov
Published in: SoCC (2020)
Keyphrases
  • prediction accuracy
  • web prefetching
  • quality of service
  • prediction model
  • prediction error
  • cloud computing
  • response time
  • prefetching
  • website
  • telecommunication services