Login / Signup
Han Zhang
ORCID
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 3
Top Topics
State Dependent
Optimal Policy
Multistage
Infinite Horizon
Top Venues
CoRR
ICLR
</>
Publications
</>
Han Zhang
,
Yu Lei
,
Lin Gui
,
Min Yang
,
Yulan He
,
Hui Wang
,
Ruifeng Xu
CPPO: Continual Learning for Reinforcement Learning with Human Feedback.
ICLR
(2024)
Han Zhang
,
Lin Gui
,
Yu Lei
,
Yuanzhao Zhai
,
Yehong Zhang
,
Yulan He
,
Hui Wang
,
Yue Yu
,
Kam-Fai Wong
,
Bin Liang
,
Ruifeng Xu
COPR: Continual Human Preference Learning via Optimal Policy Regularization.
CoRR
(2024)
Han Zhang
,
Lin Gui
,
Yuanzhao Zhai
,
Hui Wang
,
Yu Lei
,
Ruifeng Xu
COPF: Continual Learning Human Preference through Optimal Policy Fitting.
CoRR
(2023)