Login / Signup
Yuanying Cai
ORCID
Publication Activity (10 Years)
Years Active: 2020-2023
Publications (10 Years): 10
Top Topics
Temporal Difference Learning
Reinforcement Learning
Exploration Exploitation Tradeoff
Action Selection
Top Venues
CoRR
AAAI
AAMAS
ICDM
</>
Publications
</>
Yuanying Cai
,
Chuheng Zhang
,
Wei Shen
,
Xuyun Zhang
,
Wenjie Ruan
,
Longbo Huang
RePreM: Representation Pre-training with Masked Model for Reinforcement Learning.
CoRR
(2023)
Xiaowen Shi
,
Ze Wang
,
Yuanying Cai
,
Xiaoxu Wu
,
Fan Yang
,
Guogang Liao
,
Yongkang Wang
,
Xingxing Wang
,
Dong Wang
MDDL: A Framework for Reinforcement Learning-based Position Allocation in Multi-Channel Feed.
SIGIR
(2023)
Yuanying Cai
,
Chuheng Zhang
,
Wei Shen
,
Xuyun Zhang
,
Wenjie Ruan
,
Longbo Huang
RePreM: Representation Pre-training with Masked Model for Reinforcement Learning.
AAAI
(2023)
Xiaowen Shi
,
Ze Wang
,
Yuanying Cai
,
Xiaoxu Wu
,
Fan Yang
,
Guogang Liao
,
Yongkang Wang
,
Xingxing Wang
,
Dong Wang
MDDL: A Framework for Reinforcement Learning-based Position Allocation in Multi-Channel Feed.
CoRR
(2023)
Yuanying Cai
,
Chuheng Zhang
,
Hanye Zhao
,
Li Zhao
,
Jiang Bian
Curriculum Offline Reinforcement Learning.
AAMAS
(2023)
Yuanying Cai
,
Chuheng Zhang
,
Li Zhao
,
Wei Shen
,
Xuyun Zhang
,
Lei Song
,
Jiang Bian
,
Tao Qin
,
Tieyan Liu
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets.
ICDM
(2022)
Yuanying Cai
,
Chuheng Zhang
,
Li Zhao
,
Wei Shen
,
Xuyun Zhang
,
Lei Song
,
Jiang Bian
,
Tao Qin
,
Tieyan Liu
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets.
CoRR
(2022)
Yuanying Cai
,
Chuheng Zhang
,
Wei Shen
,
Xiaonan He
,
Xuyun Zhang
,
Longbo Huang
Imitation Learning to Outperform Demonstrators by Directly Extrapolating Demonstrations.
CIKM
(2022)
Chuheng Zhang
,
Yuanying Cai
,
Longbo Huang
,
Jian Li
Exploration by Maximizing Renyi Entropy for Reward-Free RL Framework.
AAAI
(2021)
Chuheng Zhang
,
Yuanying Cai
,
Longbo Huang
,
Jian Li
Exploration by Maximizing Rényi Entropy for Zero-Shot Meta RL.
CoRR
(2020)