​
Login / Signup
Yujeong Choi
Publication Activity (10 Years)
Years Active: 2019-2024
Publications (10 Years): 12
Top Topics
Scheduling Algorithm
Language Model
Processing Units
Precedence Constraints
Top Venues
CoRR
HPCA
ISCA
DAC
</>
Publications
</>
Yujeong Choi
,
Jiin Kim
,
Minsoo Rhu
ElasticRec: A Microservice-based Model Serving Architecture Enabling Elastic Resource Scaling for Recommendation Models.
ISCA
(2024)
Yujeong Choi
,
Jiin Kim
,
Minsoo Rhu
ElasticRec: A Microservice-based Model Serving Architecture Enabling Elastic Resource Scaling for Recommendation Models.
CoRR
(2024)
Yujeong Choi
,
John Kim
,
Minsoo Rhu
Hera: A Heterogeneity-Aware Multi-Tenant Inference Server for Personalized Recommendations.
CoRR
(2023)
Jehyeon Bang
,
Yujeong Choi
,
Myeongwoo Kim
,
Yongdeok Kim
,
Minsoo Rhu
vTrain: A Simulation Framework for Evaluating Cost-effective and Compute-optimal Large Language Model Training.
CoRR
(2023)
Yunseong Kim
,
Yujeong Choi
,
Minsoo Rhu
PARIS and ELSA: An Elastic Scheduling Algorithm for Reconfigurable Multi-GPU Inference Servers.
CoRR
(2022)
Yunseong Kim
,
Yujeong Choi
,
Minsoo Rhu
PARIS and ELSA: an elastic scheduling algorithm for reconfigurable multi-GPU inference servers.
DAC
(2022)
Yujeong Choi
,
Yunseong Kim
,
Minsoo Rhu
Lazy Batching: An SLA-aware Batching System for Cloud Machine Learning Inference.
HPCA
(2021)
Bongjoon Hyun
,
Youngeun Kwon
,
Yujeong Choi
,
John Kim
,
Minsoo Rhu
NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units.
ASPLOS
(2020)
Yujeong Choi
,
Yunseong Kim
,
Minsoo Rhu
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference.
CoRR
(2020)
Yujeong Choi
,
Minsoo Rhu
PREMA: A Predictive Multi-Task Scheduling Algorithm For Preemptible Neural Processing Units.
HPCA
(2020)
Bongjoon Hyun
,
Youngeun Kwon
,
Yujeong Choi
,
John Kim
,
Minsoo Rhu
NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units.
CoRR
(2019)
Yujeong Choi
,
Minsoo Rhu
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing Units.
CoRR
(2019)