One Size Does Not Fit All: Quantifying and Exposing the Accuracy-Latency Trade-off in Machine Learning Cloud Service APIs via Tolerance Tiers.
Matthew HalpernBehzad BoroujerdianTodd W. MummertEvelyn DuesterwaldVijay Janapa ReddiPublished in: CoRR (2019)