Sign in
Han Zhong
Publication Activity (10 Years)
Years Active: 2022-2023
Publications (10 Years): 11
Top Topics
Nash Equilibria
Reinforcement Learning
Tile Coding
Markov Decision Process
Top Venues
NeurIPS
CoRR
ICLR
J. Mach. Learn. Res.
</>
Publications
</>
Zhihan Liu
,
Miao Lu
,
Wei Xiong
,
Han Zhong
,
Hao Hu
,
Shenao Zhang
,
Sirui Zheng
,
Zhuoran Yang
,
Zhaoran Wang
One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration.
CoRR
(2023)
Yunchang Yang
,
Han Zhong
,
Tianhao Wu
,
Bin Liu
,
Liwei Wang
,
Simon S. Du
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback.
NeurIPS
(2023)
Jiayi Huang
,
Han Zhong
,
Liwei Wang
,
Lin F. Yang
Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds.
CoRR
(2023)
Jiayi Huang
,
Han Zhong
,
Liwei Wang
,
Lin Yang
Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds.
NeurIPS
(2023)
Wei Xiong
,
Han Zhong
,
Chengshuai Shi
,
Cong Shen
,
Liwei Wang
,
Tong Zhang
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game.
ICLR
(2023)
Jiachen Hu
,
Han Zhong
,
Chi Jin
,
Liwei Wang
Provable Sim-to-real Transfer in Continuous Domain with Partial Observations.
ICLR
(2023)
Shuang Qiu
,
Ziyu Dai
,
Han Zhong
,
Zhaoran Wang
,
Zhuoran Yang
,
Tong Zhang
Posterior Sampling for Competitive RL: Function Approximation and Partial Observation.
NeurIPS
(2023)
Han Zhong
,
Zhuoran Yang
,
Zhaoran Wang
,
Michael I. Jordan
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopically Rational Followers?
J. Mach. Learn. Res.
24 (2023)
Han Zhong
,
Tong Zhang
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes.
NeurIPS
(2023)
Jiayi Huang
,
Han Zhong
,
Liwei Wang
,
Lin F. Yang
Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation.
CoRR
(2023)
Binghui Li
,
Jikai Jin
,
Han Zhong
,
John E. Hopcroft
,
Liwei Wang
Why Robust Generalization in Deep Learning is Difficult: Perspective of Expressive Power.
NeurIPS
(2022)