Sign in
Han Zhong
ORCID
Publication Activity (10 Years)
Years Active: 2013-2024
Publications (10 Years): 45
Top Topics
Partial Observations
Nash Equilibria
Reinforcement Learning
Function Approximation
Top Venues
CoRR
ICML
NeurIPS
Concurr. Comput. Pract. Exp.
</>
Publications
</>
Rui Yang
,
Xiaoman Pan
,
Feng Luo
,
Shuang Qiu
,
Han Zhong
,
Dong Yu
,
Jianshu Chen
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment.
CoRR
(2024)
Jose H. Blanchet
,
Miao Lu
,
Tong Zhang
,
Han Zhong
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage.
NeurIPS
(2023)
Shuang Qiu
,
Ziyu Dai
,
Han Zhong
,
Zhaoran Wang
,
Zhuoran Yang
,
Tong Zhang
Posterior Sampling for Competitive RL: Function Approximation and Partial Observation.
CoRR
(2023)
Wei Xiong
,
Hanze Dong
,
Chenlu Ye
,
Han Zhong
,
Nan Jiang
,
Tong Zhang
Gibbs Sampling from Human Feedback: A Provable KL- constrained Framework for RLHF.
CoRR
(2023)
Jose Blanchet
,
Miao Lu
,
Tong Zhang
,
Han Zhong
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage.
CoRR
(2023)
Han Zhong
,
Jiachen Hu
,
Yecheng Xue
,
Tongyang Li
,
Liwei Wang
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret.
CoRR
(2023)
Rui Yang
,
Han Zhong
,
Jiawei Xu
,
Amy Zhang
,
Chongjie Zhang
,
Lei Han
,
Tong Zhang
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption.
CoRR
(2023)
Han Zhong
,
Tong Zhang
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes.
CoRR
(2023)
Guhao Feng
,
Han Zhong
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity.
CoRR
(2023)
Zhihan Liu
,
Miao Lu
,
Wei Xiong
,
Han Zhong
,
Hao Hu
,
Shenao Zhang
,
Sirui Zheng
,
Zhuoran Yang
,
Zhaoran Wang
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration.
NeurIPS
(2023)
Yunchang Yang
,
Han Zhong
,
Tianhao Wu
,
Bin Liu
,
Liwei Wang
,
Simon S. Du
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback.
CoRR
(2023)
Zhenci Sun
,
Xiaoguang Zhao
,
Lingyun Zhang
,
Ziqi Mei
,
Han Zhong
,
Rui You
,
Wenshuai Lu
,
Zheng You
,
Jiahao Zhao
WiFi Energy-Harvesting Antenna Inspired by the Resonant Magnetic Dipole Metamaterial.
Sensors
22 (17) (2022)
Jiachen Hu
,
Han Zhong
,
Chi Jin
,
Liwei Wang
Provable Sim-to-real Transfer in Continuous Domain with Partial Observations.
CoRR
(2022)
Han Zhong
,
Zheng Li
,
Peng Chen
,
Hao Lu
,
Yijia Xu
The selection of burglary cases based on multidimensional features and PageRank.
Concurr. Comput. Pract. Exp.
34 (10) (2022)
Han Zhong
,
Jin Wu
Image dehazing algorithm based on improved generative adversarial network.
ICCSIE
(2022)
Wei Xiong
,
Han Zhong
,
Chengshuai Shi
,
Cong Shen
,
Liwei Wang
,
Tong Zhang
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game.
CoRR
(2022)
Wei Xiong
,
Han Zhong
,
Chengshuai Shi
,
Cong Shen
,
Tong Zhang
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games.
CoRR
(2022)
Han Zhong
,
Zhenhu Ning
,
Guijun Li
,
Zheng Li
A method of core concept extraction based on semantic-weight ranking.
Concurr. Comput. Pract. Exp.
34 (1) (2022)
Han Zhong
,
Wei Xiong
,
Jiyuan Tan
,
Liwei Wang
,
Tong Zhang
,
Zhaoran Wang
,
Zhuoran Yang
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets.
CoRR
(2022)
Wei Xiong
,
Han Zhong
,
Chengshuai Shi
,
Cong Shen
,
Tong Zhang
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games.
ICML
(2022)
Yunchang Yang
,
Tianhao Wu
,
Han Zhong
,
Evrard Garcelon
,
Matteo Pirotta
,
Alessandro Lazaric
,
Liwei Wang
,
Simon Shaolei Du
A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning.
ICLR
(2022)
Han Zhong
,
Wei Xiong
,
Jiyuan Tan
,
Liwei Wang
,
Tong Zhang
,
Zhaoran Wang
,
Zhuoran Yang
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets.
ICML
(2022)
Han Zhong
,
Wei Xiong
,
Sirui Zheng
,
Liwei Wang
,
Zhaoran Wang
,
Zhuoran Yang
,
Tong Zhang
A Posterior Sampling Framework for Interactive Decision Making.
CoRR
(2022)
Xiaoyu Chen
,
Han Zhong
,
Zhuoran Yang
,
Zhaoran Wang
,
Liwei Wang
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation.
ICML
(2022)
Binghui Li
,
Jikai Jin
,
Han Zhong
,
John E. Hopcroft
,
Liwei Wang
Why Robust Generalization in Deep Learning is Difficult: Perspective of Expressive Power.
CoRR
(2022)
Tianhao Wu
,
Yunchang Yang
,
Han Zhong
,
Liwei Wang
,
Simon S. Du
,
Jiantao Jiao
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee.
ICML
(2022)
Xiaoyu Chen
,
Han Zhong
,
Zhuoran Yang
,
Zhaoran Wang
,
Liwei Wang
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation.
CoRR
(2022)
Han Zhong
,
Jiayi Huang
,
Lin F. Yang
,
Liwei Wang
Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs.
CoRR
(2021)
Yunchang Yang
,
Tianhao Wu
,
Han Zhong
,
Evrard Garcelon
,
Matteo Pirotta
,
Alessandro Lazaric
,
Liwei Wang
,
Simon S. Du
A Unified Framework for Conservative Exploration.
CoRR
(2021)
Runzhou Zhang
,
Han Zhong
,
Tongyi Zheng
,
Lei Ning
Trajectory Mining-Based City-Level Mobility Model for 5G NB-IoT Networks.
Wirel. Commun. Mob. Comput.
2021 (2021)
Tianhao Wu
,
Yunchang Yang
,
Han Zhong
,
Liwei Wang
,
Simon S. Du
,
Jiantao Jiao
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee.
CoRR
(2021)
Han Zhong
,
Zhuoran Yang
,
Zhaoran Wang
,
Csaba Szepesvári
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs.
CoRR
(2021)
Han Zhong
,
Shiqiang Zhang
,
Jianli Liu
A research framework for constructing the knowledge database of public security information.
Int. J. Wirel. Mob. Comput.
21 (3) (2021)
Han Zhong
,
Zhuoran Yang
,
Zhaoran Wang
,
Michael I. Jordan
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers?
CoRR
(2021)
Han Zhong
,
Ruize Sun
,
Fengcheng Mei
,
Yong Chen
,
Fan Jin
,
Lei Ning
Deep Grid Scheduler for 5G NB-IoT Uplink Transmission.
Secur. Commun. Networks
2021 (2021)
Han Zhong
,
Jiayi Huang
,
Lin Yang
,
Liwei Wang
Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs.
NeurIPS
(2021)
Han Zhong
,
Hong Liu
,
Geqi Qi
Analysis of Terminal Area Airspace Operation Status Based on Trajectory Characteristic Point Clustering.
IEEE Access
9 (2021)
Kaixin Chen
,
Xiao Lin
,
Xing Hu
,
Jiayao Wang
,
Han Zhong
,
Linhua Jiang
An enhanced adaptive non-local means algorithm for Rician noise reduction in magnetic resonance brain images.
BMC Medical Imaging
20 (1) (2020)
Jianhui Chen
,
Ningning Wang
,
Yue Deng
,
Han Zhong
,
Jian Han
,
Youjun Li
,
Zhijiang Wan
,
Taihei Kotake
,
Dongsheng Wang
,
Ning Zhong
Wisdom as a Service for Mental Health Care.
IEEE Trans. Cloud Comput.
8 (2) (2020)
Han Zhong
,
Ethan X. Fang
,
Zhuoran Yang
,
Zhaoran Wang
Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy.
CoRR
(2020)
Han Zhong
,
Zhisheng Huang
Document recommendation based on interests of co-authors for brain science.
Health Inf. Sci. Syst.
7 (1) (2019)
Han Zhong
,
Geqi Qi
,
Wei Guan
,
Xiaochen Hua
Application of Non-Negative Tensor Factorization for Airport Flight Delay Pattern Recognition.
IEEE Access
7 (2019)
Han Zhong
,
Zhisheng Huang
Document Recommendation Based on Interests of Co-authors for Brain Science.
HIS
(2019)
Haiyang Yu
,
Yongquan Cai
,
Shanshan Kong
,
Zhenhu Ning
,
Fei Xue
,
Han Zhong
Efficient and Secure Identity-Based Public Auditing for Dynamic Outsourced Data with Proxy.
KSII Trans. Internet Inf. Syst.
11 (10) (2017)
Ningning Wang
,
Ning Zhong
,
Jian Han
,
Jianhui Chen
,
Han Zhong
,
Taihei Kotake
,
Dongsheng Wang
,
Jianzhuo Yan
A Personalized Method of Literature Recommendation Based on Brain Informatics Provenances.
BIH
(2015)
Han Zhong
,
Jianhui Chen
,
Jian Han
,
Ning Zhong
Data-Brain Driven Documents Ranking for Constructing Brain Informatics Provenances.
Brain Informatics and Health
(2014)
Jian Han
,
Jianhui Chen
,
Han Zhong
,
Ning Zhong
A Brain Informatics Research Recommendation System.
Brain Informatics and Health
(2014)
Han Zhong
,
Jianhui Chen
,
Taihei Kotake
,
Jian Han
,
Ning Zhong
,
Zhisheng Huang
Developing a Brain Informatics Provenance Model.
Brain and Health Informatics
(2013)