Login / Signup

Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems.

Mehran SalmaniSaeid GhafouriAlireza SanaeeKamran RazaviMax MühlhäuserJoseph DoylePooyan JamshidiMohsen Sharifi
Published in: CoRR (2023)
Keyphrases
  • low latency
  • high accuracy
  • highly efficient
  • computational complexity
  • computer systems
  • database
  • multi dimensional
  • virtual machine
  • high bandwidth