Login / Signup
Yujeong Choi
Publication Activity (10 Years)
Years Active: 2019-2023
Publications (10 Years): 10
Top Topics
Precedence Constraints
Scheduling Algorithm
Processing Units
Low Overhead
Top Venues
CoRR
HPCA
DAC
ASPLOS
</>
Publications
</>
Yujeong Choi
,
John Kim
,
Minsoo Rhu
Hera: A Heterogeneity-Aware Multi-Tenant Inference Server for Personalized Recommendations.
CoRR
(2023)
Jehyeon Bang
,
Yujeong Choi
,
Myeongwoo Kim
,
Yongdeok Kim
,
Minsoo Rhu
vTrain: A Simulation Framework for Evaluating Cost-effective and Compute-optimal Large Language Model Training.
CoRR
(2023)
Yunseong Kim
,
Yujeong Choi
,
Minsoo Rhu
PARIS and ELSA: An Elastic Scheduling Algorithm for Reconfigurable Multi-GPU Inference Servers.
CoRR
(2022)
Yunseong Kim
,
Yujeong Choi
,
Minsoo Rhu
PARIS and ELSA: an elastic scheduling algorithm for reconfigurable multi-GPU inference servers.
DAC
(2022)
Yujeong Choi
,
Yunseong Kim
,
Minsoo Rhu
Lazy Batching: An SLA-aware Batching System for Cloud Machine Learning Inference.
HPCA
(2021)
Bongjoon Hyun
,
Youngeun Kwon
,
Yujeong Choi
,
John Kim
,
Minsoo Rhu
NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units.
ASPLOS
(2020)
Yujeong Choi
,
Yunseong Kim
,
Minsoo Rhu
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference.
CoRR
(2020)
Yujeong Choi
,
Minsoo Rhu
PREMA: A Predictive Multi-Task Scheduling Algorithm For Preemptible Neural Processing Units.
HPCA
(2020)
Bongjoon Hyun
,
Youngeun Kwon
,
Yujeong Choi
,
John Kim
,
Minsoo Rhu
NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units.
CoRR
(2019)
Yujeong Choi
,
Minsoo Rhu
PREMA: A Predictive Multi-task Scheduling Algorithm For Preemptible Neural Processing Units.
CoRR
(2019)