One Size Does Not Fit All: Quantifying and Exposing the Accuracy-Latency Trade-Off in Machine Learning Cloud Service APIs via Tolerance Tiers.
Matthew HalpernBehzad BoroujerdianTodd W. MummertEvelyn DuesterwaldVijay Janapa ReddiPublished in: ISPASS (2019)