​
Login / Signup
Runzhe Wu
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 14
Top Topics
Policy Evaluation
Partially Observable Markov
Imitation Learning
Markov Decision Chains
Top Venues
CoRR
NeurIPS
J. Mach. Learn. Res.
ICLR
</>
Publications
</>
Runzhe Wu
,
Ayush Sekhari
,
Akshay Krishnamurthy
,
Wen Sun
Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics.
CoRR
(2024)
Runzhe Wu
,
Wen Sun
Making RL with Preference-based Feedback Efficient via Randomization.
ICLR
(2024)
Ayush Sekhari
,
Karthik Sridharan
,
Wen Sun
,
Runzhe Wu
Contextual Bandits and Imitation Learning via Preference-Based Active Queries.
CoRR
(2023)
Ayush Sekhari
,
Karthik Sridharan
,
Wen Sun
,
Runzhe Wu
Contextual Bandits and Imitation Learning with Preference-Based Active Queries.
NeurIPS
(2023)
Kaiwen Wang
,
Kevin Zhou
,
Runzhe Wu
,
Nathan Kallus
,
Wen Sun
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning.
CoRR
(2023)
Ayush Sekhari
,
Karthik Sridharan
,
Wen Sun
,
Runzhe Wu
Selective Sampling and Imitation Learning via Online Regression.
CoRR
(2023)
Runzhe Wu
,
Wen Sun
Making RL with Preference-based Feedback Efficient via Randomization.
CoRR
(2023)
Ming Zhou
,
Ziyu Wan
,
Hanjing Wang
,
Muning Wen
,
Runzhe Wu
,
Ying Wen
,
Yaodong Yang
,
Yong Yu
,
Jun Wang
,
Weinan Zhang
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.
J. Mach. Learn. Res.
24 (2023)
Kaiwen Wang
,
Kevin Zhou
,
Runzhe Wu
,
Nathan Kallus
,
Wen Sun
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning.
NeurIPS
(2023)
Runzhe Wu
,
Masatoshi Uehara
,
Wen Sun
Distributional Offline Policy Evaluation with Predictive Error Guarantees.
CoRR
(2023)
Ayush Sekhari
,
Karthik Sridharan
,
Wen Sun
,
Runzhe Wu
Selective Sampling and Imitation Learning via Online Regression.
NeurIPS
(2023)
Runzhe Wu
,
Masatoshi Uehara
,
Wen Sun
Distributional Offline Policy Evaluation with Predictive Error Guarantees.
ICML
(2023)
Runzhe Wu
,
Yufeng Zhang
,
Zhuoran Yang
,
Zhaoran Wang
Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration.
NeurIPS
(2021)
Ming Zhou
,
Ziyu Wan
,
Hanjing Wang
,
Muning Wen
,
Runzhe Wu
,
Ying Wen
,
Yaodong Yang
,
Weinan Zhang
,
Jun Wang
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning.
CoRR
(2021)