Login / Signup
Archit Patke
Publication Activity (10 Years)
Years Active: 2019-2024
Publications (10 Years): 13
Top Topics
Cloud Platform
Service Requirements
Online Learning
Network Congestion
Top Venues
CoRR
USENIX ATC
WOSC@Middleware
IPDPS Workshops
</>
Publications
</>
Haoran Qiu
,
Weichao Mao
,
Archit Patke
,
Shengkun Cui
,
Saurabh Jha
,
Chen Wang
,
Hubertus Franke
,
Zbigniew T. Kalbarczyk
,
Tamer Basar
,
Ravishankar K. Iyer
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction.
CoRR
(2024)
Haoran Qiu
,
Weichao Mao
,
Archit Patke
,
Shengkun Cui
,
Chen Wang
,
Hubertus Franke
,
Zbigniew Kalbarczyk
,
Tamer Basar
,
Ravi K. Iyer
FLASH: Fast Model Adaptation in ML-Centric Cloud Platforms.
MLSys
(2024)
Archit Patke
,
Dhemath Reddy
,
Saurabh Jha
,
Haoran Qiu
,
Christian Pinto
,
Shengkun Cui
,
Chandra Narayanaswami
,
Zbigniew Kalbarczyk
,
Ravishankar K. Iyer
One Queue Is All You Need: Resolving Head-of-Line Blocking in Large Language Model Serving.
CoRR
(2024)
Haoran Qiu
,
Weichao Mao
,
Archit Patke
,
Shengkun Cui
,
Saurabh Jha
,
Chen Wang
,
Hubertus Franke
,
Zbigniew Kalbarczyk
,
Tamer Basar
,
Ravishankar K. Iyer
Power-aware Deep Learning Model Serving with μ-Serve.
USENIX ATC
(2024)
Haoran Qiu
,
Weichao Mao
,
Archit Patke
,
Chen Wang
,
Hubertus Franke
,
Zbigniew T. Kalbarczyk
,
Tamer Basar
,
Ravishankar K. Iyer
Reinforcement learning for resource management in multi-tenant serverless platforms.
EuroMLSys@EuroSys
(2022)
Archit Patke
,
Haoran Qiu
,
Saurabh Jha
,
Srikumar Venugopal
,
Michele Gazzetti
,
Christian Pinto
,
Zbigniew Kalbarczyk
,
Ravishankar K. Iyer
Evaluating Hardware Memory Disaggregation under Delay and Contention.
IPDPS Workshops
(2022)
Haoran Qiu
,
Weichao Mao
,
Archit Patke
,
Chen Wang
,
Hubertus Franke
,
Zbigniew T. Kalbarczyk
,
Tamer Basar
,
Ravishankar K. Iyer
SIMPPO: a scalable and incremental online learning framework for serverless resource management.
SoCC
(2022)
Haoran Qiu
,
Saurabh Jha
,
Subho S. Banerjee
,
Archit Patke
,
Chen Wang
,
Hubertus Franke
,
Zbigniew T. Kalbarczyk
,
Ravishankar K. Iyer
Is Function-as-a-Service a Good Fit for Latency-Critical Services?
WOSC@Middleware
(2021)
Archit Patke
,
Saurabh Jha
,
Haoran Qiu
,
Jim M. Brandt
,
Ann C. Gentile
,
Joe Greenseid
,
Zbigniew Kalbarczyk
,
Ravishankar K. Iyer
Delay sensitivity-driven congestion mitigation for HPC systems.
ICS
(2021)
Saurabh Jha
,
Archit Patke
,
Jim M. Brandt
,
Ann C. Gentile
,
Benjamin Lim
,
Mike Showerman
,
Greg Bauer
,
Larry Kaplan
,
Zbigniew Kalbarczyk
,
William Kramer
,
Ravi K. Iyer
Measuring Congestion in High-Performance Datacenter Interconnects.
NSDI
(2020)
Archit Patke
,
Saurabh Jha
,
Haoran Qiu
,
Jim M. Brandt
,
Ann C. Gentile
,
Joe Greenseid
,
Zbigniew Kalbarczyk
,
Ravishankar K. Iyer
Application-aware Congestion Mitigation forHigh-Performance Computing Systems.
CoRR
(2020)
Saurabh Jha
,
Archit Patke
,
Jim M. Brandt
,
Ann C. Gentile
,
Mike Showerman
,
Eric Roman
,
Zbigniew T. Kalbarczyk
,
William T. Kramer
,
Ravishankar K. Iyer
A Study of Network Congestion in Two Supercomputing High-Speed Interconnects.
CoRR
(2019)
Saurabh Jha
,
Archit Patke
,
Jim M. Brandt
,
Ann C. Gentile
,
Mike Showerman
,
Eric Roman
,
Zbigniew T. Kalbarczyk
,
Bill Kramer
,
Ravishankar K. Iyer
A Study of Network Congestion in Two Supercomputing High-Speed Interconnects.
Hot Interconnects
(2019)