Login / Signup
Jiafan He
ORCID
Publication Activity (10 Years)
Years Active: 2017-2024
Publications (10 Years): 55
Top Topics
Function Approximation
Markov Decision Process
Reinforcement Learning
Regret Bounds
Top Venues
CoRR
ICML
NeurIPS
ASCC
</>
Publications
</>
Qiwei Di
,
Jiafan He
,
Dongruo Zhou
,
Quanquan Gu
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path.
CoRR
(2024)
Weitong Zhang
,
Zhiyuan Fan
,
Jiafan He
,
Quanquan Gu
Settling Constant Regrets in Linear Markov Decision Processes.
CoRR
(2024)
Qiwei Di
,
Jiafan He
,
Quanquan Gu
Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback.
CoRR
(2024)
Zhihao Zhu
,
Jiafan He
,
Luyang Hou
,
Lianming Xu
,
Wendi Zhu
,
Li Wang
Emergency Localization for Mobile Ground Users: An Adaptive UAV Trajectory Planning Method.
INFOCOM (Workshops)
(2024)
Kaixuan Ji
,
Qingyue Zhao
,
Jiafan He
,
Weitong Zhang
,
Quanquan Gu
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs.
ICLR
(2024)
Qiwei Di
,
Heyang Zhao
,
Jiafan He
,
Quanquan Gu
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning.
ICLR
(2024)
Kaixuan Ji
,
Jiafan He
,
Quanquan Gu
Reinforcement Learning from Human Feedback with Active Queries.
CoRR
(2024)
Zhihao Zhu
,
Jiafan He
,
Luyang Hou
,
Lianming Xu
,
Wendi Zhu
,
Li Wang
Emergency Localization for Mobile Ground Users: An Adaptive UAV Trajectory Planning Method.
CoRR
(2024)
Jie Wang
,
Jie Yang
,
Jiafan He
,
Dongliang Peng
Multi-Augmentation-Based Contrastive Learning for Semi-Supervised Learning.
Algorithms
17 (3) (2024)
Chenlu Ye
,
Jiafan He
,
Quanquan Gu
,
Tong Zhang
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption.
CoRR
(2024)
Jiafan He
,
Heyang Zhao
,
Dongruo Zhou
,
Quanquan Gu
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes.
ICML
(2023)
Heyang Zhao
,
Jiafan He
,
Quanquan Gu
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation.
CoRR
(2023)
Heyang Zhao
,
Jiafan He
,
Dongruo Zhou
,
Tong Zhang
,
Quanquan Gu
Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency.
CoRR
(2023)
Qiwei Di
,
Heyang Zhao
,
Jiafan He
,
Quanquan Gu
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning.
CoRR
(2023)
Weitong Zhang
,
Jiafan He
,
Zhiyuan Fan
,
Quanquan Gu
On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits.
CoRR
(2023)
Yue Wu
,
Jiafan He
,
Quanquan Gu
Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension.
UAI
(2023)
Weitong Zhang
,
Jiafan He
,
Dongruo Zhou
,
Amy Zhang
,
Quanquan Gu
Provably efficient representation selection in Low-rank Markov Decision Processes: from online to offline RL.
UAI
(2023)
Heyang Zhao
,
Jiafan He
,
Dongruo Zhou
,
Tong Zhang
,
Quanquan Gu
Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency.
COLT
(2023)
Kaixuan Ji
,
Qingyue Zhao
,
Jiafan He
,
Weitong Zhang
,
Quanquan Gu
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs.
CoRR
(2023)
Jiafan He
,
Aiguo Fei
,
Qingwei Li
,
Feng Fang
Attitude Synchronization of Heterogenous Flexible Spacecrafts by Measurement-Based Feedback With Disturbance Suppression.
IEEE Access
11 (2023)
Yue Wu
,
Jiafan He
,
Quanquan Gu
Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension.
CoRR
(2023)
Qiwei Di
,
Jiafan He
,
Dongruo Zhou
,
Quanquan Gu
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path.
ICML
(2023)
Heyang Zhao
,
Dongruo Zhou
,
Jiafan He
,
Quanquan Gu
Optimal Online Generalized Linear Regression with Stochastic Noise and Its Application to Heteroscedastic Bandits.
ICML
(2023)
Yifei Min
,
Jiafan He
,
Tianhao Wang
,
Quanquan Gu
Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation.
ICML
(2023)
Weitong Zhang
,
Jiafan He
,
Zhiyuan Fan
,
Quanquan Gu
On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits.
ICML
(2023)
Yifei Min
,
Jiafan He
,
Tianhao Wang
,
Quanquan Gu
Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation.
CoRR
(2023)
Heyang Zhao
,
Dongruo Zhou
,
Jiafan He
,
Quanquan Gu
Bandit Learning with General Function Classes: Heteroscedastic Noise and Variance-dependent Regret Bounds.
CoRR
(2022)
Jiafan He
,
Dongruo Zhou
,
Tong Zhang
,
Quanquan Gu
Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions.
NeurIPS
(2022)
Jiafan He
,
Tianhao Wang
,
Yifei Min
,
Quanquan Gu
A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits.
CoRR
(2022)
Jiafan He
,
Heyang Zhao
,
Dongruo Zhou
,
Quanquan Gu
Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes.
CoRR
(2022)
Yiming Mao
,
Zhijie Xia
,
Qingwei Li
,
Jiafan He
,
Aiguo Fei
Accurate Decision-Making Method for Air Combat Pilots Based on Data-Driven.
DMBD (2)
(2022)
Chonghua Liao
,
Jiafan He
,
Quanquan Gu
Locally Differentially Private Reinforcement Learning for Linear Mixture Markov Decision Processes.
ACML
(2022)
Yifei Min
,
Jiafan He
,
Tianhao Wang
,
Quanquan Gu
Learning Stochastic Shortest Path with Linear Function Approximation.
ICML
(2022)
Jiafan He
,
Dongruo Zhou
,
Tong Zhang
,
Quanquan Gu
Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions.
CoRR
(2022)
Jiafan He
,
Dongruo Zhou
,
Quanquan Gu
Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs.
AISTATS
(2022)
Yuanzhou Chen
,
Jiafan He
,
Quanquan Gu
On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs.
ICML
(2022)
Jiaqi Wang
,
Wei Xing Zheng
,
Andong Sheng
,
Jiafan He
Cooperative Global Robust Practical Output Regulation of Nonlinear Lower Triangular Multiagent Systems via Event-Triggered Control.
IEEE Trans. Cybern.
52 (7) (2022)
Jiafan He
,
Tianhao Wang
,
Yifei Min
,
Quanquan Gu
A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits.
NeurIPS
(2022)
Jiafan He
,
Dongruo Zhou
,
Quanquan Gu
Nearly Optimal Regret for Learning Adversarial MDPs with Linear Function Approximation.
CoRR
(2021)
Jiafan He
,
Dongruo Zhou
,
Quanquan Gu
Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation.
NeurIPS
(2021)
Chonghua Liao
,
Jiafan He
,
Quanquan Gu
Locally Differentially Private Reinforcement Learning for Linear Mixture Markov Decision Processes.
CoRR
(2021)
Jiafan He
,
Dongruo Zhou
,
Quanquan Gu
Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs.
NeurIPS
(2021)
Jiafan He
,
Dongruo Zhou
,
Quanquan Gu
Logarithmic Regret for Reinforcement Learning with Linear Function Approximation.
ICML
(2021)
Weitong Zhang
,
Jiafan He
,
Dongruo Zhou
,
Amy Zhang
,
Quanquan Gu
Provably Efficient Representation Learning in Low-rank Markov Decision Processes.
CoRR
(2021)
Dongruo Zhou
,
Jiafan He
,
Quanquan Gu
Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping.
ICML
(2021)
Yifei Min
,
Jiafan He
,
Tianhao Wang
,
Quanquan Gu
Learning Stochastic Shortest Path with Linear Function Approximation.
CoRR
(2021)
Jiafan He
,
Dongruo Zhou
,
Quanquan Gu
Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation.
CoRR
(2021)
Jiafan He
,
Dongruo Zhou
,
Quanquan Gu
Logarithmic Regret for Reinforcement Learning with Linear Function Approximation.
CoRR
(2020)
Dongruo Zhou
,
Jiafan He
,
Quanquan Gu
Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping.
CoRR
(2020)
Jiafan He
,
Dongruo Zhou
,
Quanquan Gu
Minimax Optimal Reinforcement Learning for Discounted MDPs.
CoRR
(2020)
Jiafan He
,
Ariel D. Procaccia
,
Alexandros Psomas
,
David Zeng
Achieving a Fairer Future by Changing the Past.
IJCAI
(2019)
Jiafan He
,
Youfeng Su
,
Dabo Xu
,
Andong Sheng
Event-Triggered Attitude Regulation of Rigid Spacecraft with Uncertain Inertia Matrix.
ASCC
(2019)
Jiafan He
,
Andong Sheng
,
Dabo Xu
Robust Attitude Regulation of Uncertain Spacecraft with Flexible Appendages.
ICNSC
(2019)
Pengpeng Ye
,
Jiafan He
,
Yinya Li
,
Guoqing Qi
,
Andong Sheng
Rectangular Impulsive Consensus of Multi-agent Systems with Heterogeneous Control Widths.
ASCC
(2019)
Dabo Xu
,
Jiafan He
,
Andong Sheng
,
Zhiyong Chen
,
Dan Wang
Robust attitude tracking control of a rigid spacecraft based on nonlinearly controlled quaternions.
ASCC
(2017)