Login / Signup
Reconciling High Accuracy, Cost-Efficiency, and Low Latency of Inference Serving Systems.
Mehran Salmani
Saeid Ghafouri
Alireza Sanaee
Kamran Razavi
Max Mühlhäuser
Joseph Doyle
Pooyan Jamshidi
Mohsen Sharifi
Published in:
EuroMLSys@EuroSys (2023)
Keyphrases
</>
low latency
high accuracy
highly efficient
high throughput
real time
distributed systems
data sets
database systems
high bandwidth