Sign in

FaaSwap: SLO-Aware, GPU-Efficient Serverless Inference via Model Swapping.

Minchen YuAo WangDong ChenHaoxuan YuXiaonan LuoZhuohao LiWei WangRuichuan ChenDapeng NieHaoran Yang
Published in: CoRR (2023)
Keyphrases