Login / Signup
Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems.
Mehran Salmani
Saeid Ghafouri
Alireza Sanaee
Kamran Razavi
Max Mühlhäuser
Joseph Doyle
Pooyan Jamshidi
Mohsen Sharifi
Published in:
CoRR (2023)
Keyphrases
</>
low latency
high accuracy
highly efficient
computational complexity
computer systems
database
multi dimensional
virtual machine
high bandwidth