Login / Signup
Weichao Mao
ORCID
Publication Activity (10 Years)
Years Active: 2018-2024
Publications (10 Years): 31
Top Topics
Resource Management
Matching Algorithm
Regret Bounds
Reinforcement Learning
Top Venues
CoRR
NeurIPS
CDC
L4DC
</>
Publications
</>
Xiangyuan Zhang
,
Weichao Mao
,
Haoran Qiu
,
Tamer Basar
Decision Transformer as a Foundation Model for Partially Observable Continuous Control.
CoRR
(2024)
Haoran Qiu
,
Weichao Mao
,
Archit Patke
,
Shengkun Cui
,
Saurabh Jha
,
Chen Wang
,
Hubertus Franke
,
Zbigniew T. Kalbarczyk
,
Tamer Basar
,
Ravishankar K. Iyer
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction.
CoRR
(2024)
Haoran Qiu
,
Weichao Mao
,
Archit Patke
,
Shengkun Cui
,
Chen Wang
,
Hubertus Franke
,
Zbigniew Kalbarczyk
,
Tamer Basar
,
Ravi K. Iyer
FLASH: Fast Model Adaptation in ML-Centric Cloud Platforms.
MLSys
(2024)
Weichao Mao
,
Haoran Qiu
,
Chen Wang
,
Hubertus Franke
,
Zbigniew Kalbarczyk
,
Tamer Basar
$\widetilde{O}(T^{-1})$ {C}onvergence to (coarse) correlated equilibria in full-information general-sum markov games.
L4DC
(2024)
Weichao Mao
,
Haoran Qiu
,
Chen Wang
,
Hubertus Franke
,
Zbigniew Kalbarczyk
,
Tamer Basar
) Convergence to (Coarse) Correlated Equilibria in Full-Information General-Sum Markov Games.
CoRR
(2024)
Haoran Qiu
,
Weichao Mao
,
Archit Patke
,
Shengkun Cui
,
Saurabh Jha
,
Chen Wang
,
Hubertus Franke
,
Zbigniew Kalbarczyk
,
Tamer Basar
,
Ravishankar K. Iyer
Power-aware Deep Learning Model Serving with μ-Serve.
USENIX ATC
(2024)
Xiangyuan Zhang
,
Weichao Mao
,
Saviz Mowlavi
,
Mouhacine Benosman
,
Tamer Basar
Controlgym: Large-scale control environments for benchmarking reinforcement learning algorithms.
L4DC
(2024)
Weichao Mao
,
Haoran Qiu
,
Chen Wang
,
Hubertus Franke
,
Zbigniew Kalbarczyk
,
Ravishankar K. Iyer
,
Tamer Basar
Multi-Agent Meta-Reinforcement Learning: Sharper Convergence Rates with Task Similarity.
NeurIPS
(2023)
Haoran Qiu
,
Weichao Mao
,
Chen Wang
,
Hubertus Franke
,
Alaa Youssef
,
Zbigniew T. Kalbarczyk
,
Tamer Basar
,
Ravishankar K. Iyer
AWARE: Automate Workload Autoscaling with Reinforcement Learning in Production Cloud Systems.
USENIX Annual Technical Conference
(2023)
Weichao Mao
,
Ruta Desai
,
Michael Louis Iuzzolino
,
Nitin Kamra
Action Dynamics Task Graphs for Learning Plannable Representations of Procedural Tasks.
CoRR
(2023)
Xiangyuan Zhang
,
Weichao Mao
,
Saviz Mowlavi
,
Mouhacine Benosman
,
Tamer Basar
Controlgym: Large-Scale Safety-Critical Control Environments for Benchmarking Reinforcement Learning Algorithms.
CoRR
(2023)
Weichao Mao
,
Tamer Basar
Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games.
Dyn. Games Appl.
13 (1) (2023)
Weichao Mao
,
Lin Yang
,
Kaiqing Zhang
,
Tamer Basar
On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning.
ICML
(2022)
Haoran Qiu
,
Weichao Mao
,
Archit Patke
,
Chen Wang
,
Hubertus Franke
,
Zbigniew T. Kalbarczyk
,
Tamer Basar
,
Ravishankar K. Iyer
Reinforcement learning for resource management in multi-tenant serverless platforms.
EuroMLSys@EuroSys
(2022)
Weichao Mao
,
Haoran Qiu
,
Chen Wang
,
Hubertus Franke
,
Zbigniew Kalbarczyk
,
Ravishankar K. Iyer
,
Tamer Basar
A Mean-Field Game Approach to Cloud Resource Management with Function Approximation.
NeurIPS
(2022)
Haoran Qiu
,
Weichao Mao
,
Archit Patke
,
Chen Wang
,
Hubertus Franke
,
Zbigniew T. Kalbarczyk
,
Tamer Basar
,
Ravishankar K. Iyer
SIMPPO: a scalable and incremental online learning framework for serverless resource management.
SoCC
(2022)
Weichao Mao
,
Tamer Basar
Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games.
CoRR
(2021)
Weichao Mao
,
Kaiqing Zhang
,
Ruihao Zhu
,
David Simchi-Levi
,
Tamer Basar
Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs.
ICML
(2021)
Sujay Bhatt
,
Weichao Mao
,
Alec Koppel
,
Tamer Basar
Semiparametric Information State Embedding for Policy Search under Imperfect Information.
CDC
(2021)
Weichao Mao
,
Tamer Basar
,
Lin F. Yang
,
Kaiqing Zhang
Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration.
CoRR
(2021)
Weichao Mao
,
Kaiqing Zhang
,
Qiaomin Xie
,
Tamer Basar
POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis.
NeurIPS
(2020)
Weichao Mao
,
Kaiqing Zhang
,
Erik Miehling
,
Tamer Basar
Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning.
CoRR
(2020)
Weichao Mao
,
Kaiqing Zhang
,
Qiaomin Xie
,
Tamer Basar
POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis.
CoRR
(2020)
Weichao Mao
,
Kaiqing Zhang
,
Ruihao Zhu
,
David Simchi-Levi
,
Tamer Basar
Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs.
CoRR
(2020)
Weichao Mao
,
Kaiqing Zhang
,
Erik Miehling
,
Tamer Basar
Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning.
CDC
(2020)
Zhenzhe Zheng
,
Weichao Mao
,
Fan Wu
,
Guihai Chen
Challenges and Opportunities in IoT Data Markets.
SocialSens@CPSIoTWeek
(2019)
Shiyou Qian
,
Jian Cao
,
Weichao Mao
,
Yanmin Zhu
,
Jiadi Yu
,
Minglu Li
,
Jie Wang
A fast and anti-matchability matching algorithm for content-based publish/subscribe systems.
Comput. Networks
149 (2019)
Weichao Mao
,
Zhenzhe Zheng
,
Fan Wu
Pricing for Revenue Maximization in IoT Data Markets: An Information Design Perspective.
INFOCOM
(2019)
Shiyou Qian
,
Weichao Mao
,
Jian Cao
,
Frederic Le Mouel
,
Minglu Li
Adjusting Matching Algorithm to Adapt to Workload Fluctuations in Content-based Publish/Subscribe Systems.
INFOCOM
(2019)
Weichao Mao
,
Zhenzhe Zheng
,
Fan Wu
,
Guihai Chen
Online Pricing for Revenue Maximization with Unknown Time Discounting Valuations.
IJCAI
(2018)
Weichao Mao
,
Jian Cao
,
Guangtao Xue
,
Jiadi Yu
,
Yanmin Zhu
,
Minglu Li
,
Wenjuan Li
,
Shiyou Qian
Adjusting Matching Algorithm to Adapt to Dynamic Subscriptions in Content-Based Publish/Subscribe Systems.
ISPA/IUCC/BDCloud/SocialCom/SustainCom
(2018)