​
Login / Signup
Minchen Yu
ORCID
Publication Activity (10 Years)
Years Active: 2018-2024
Publications (10 Years): 10
Top Topics
Highly Ranked
Structured Data
Environmentally Friendly
Machine Learning
Top Venues
CoRR
USENIX Annual Technical Conference
ICDCS
IEEE Trans. Cloud Comput.
</>
Publications
</>
Suyi Li
,
Hanfeng Lu
,
Tianyuan Wu
,
Minchen Yu
,
Qizhen Weng
,
Xusheng Chen
,
Yizhou Shan
,
Binhang Yuan
,
Wei Wang
CaraServe: CPU-Assisted and Rank-Aware LoRA Serving for Generative LLM Inference.
CoRR
(2024)
Minchen Yu
,
Ao Wang
,
Dong Chen
,
Haoxuan Yu
,
Xiaonan Luo
,
Zhuohao Li
,
Wei Wang
,
Ruichuan Chen
,
Dapeng Nie
,
Haoran Yang
FaaSwap: SLO-Aware, GPU-Efficient Serverless Inference via Model Swapping.
CoRR
(2023)
Minchen Yu
,
Tingjia Cao
,
Wei Wang
,
Ruichuan Chen
Following the Data, Not the Function: Rethinking Function Orchestration in Serverless Computing.
NSDI
(2023)
Chengliang Zhang
,
Minchen Yu
,
Wei Wang
,
Feng Yan
Enabling Cost-Effective, SLO-Aware Machine Learning Inference Serving on Public Cloud.
IEEE Trans. Cloud Comput.
10 (3) (2022)
Minchen Yu
,
Tingjia Cao
,
Wei Wang
,
Ruichuan Chen
Restructuring Serverless Computing with Data-Centric Function Orchestration.
CoRR
(2021)
Minchen Yu
,
Zhifeng Jiang
,
Hok Chun Ng
,
Wei Wang
,
Ruichuan Chen
,
Bo Li
Gillis: Serving Large Neural Networks in Serverless Functions with Automatic Model Partitioning.
ICDCS
(2021)
Huangshi Tian
,
Minchen Yu
,
Wei Wang
CrystalPerf: Learning to Characterize the Performance of Dataflow Computation through Code Analysis.
USENIX Annual Technical Conference
(2021)
Minchen Yu
,
Yinghao Yu
,
Yunchuan Zheng
,
Baichen Yang
,
Wei Wang
RepBun: Load-Balanced, Shuffle-Free Cluster Caching for Structured Data.
INFOCOM
(2020)
Chengliang Zhang
,
Minchen Yu
,
Wei Wang
,
Feng Yan
MArk: Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving.
USENIX Annual Technical Conference
(2019)
Huangshi Tian
,
Minchen Yu
,
Wei Wang
Continuum: A Platform for Cost-Aware, Low-Latency Continual Learning.
SoCC
(2018)