Login / Signup
Qisong Yang
ORCID
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 9
Top Topics
Active Exploration
Temporal Difference
Model Free
Reinforcement Learning
Top Venues
AAAI
CoRR
Mach. Learn.
ITSC
</>
Publications
</>
Ruining Zhang
,
Haoran Han
,
Maolong Lv
,
Qisong Yang
,
Jian Cheng
Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System.
AAAI
(2024)
Qisong Yang
,
Thiago D. Simão
,
Nils Jansen
,
Simon H. Tindemans
,
Matthijs T. J. Spaan
Reinforcement Learning by Guided Safe Exploration.
CoRR
(2023)
Qisong Yang
,
Thiago D. Simão
,
Nils Jansen
,
Simon H. Tindemans
,
Matthijs T. J. Spaan
Reinforcement Learning by Guided Safe Exploration.
ECAI
(2023)
Ruining Zhang
,
Haoran Han
,
Maolong Lv
,
Qisong Yang
,
Jian Cheng
Analyzing Generalization in Policy Networks: A Case Study with the Double-Integrator System.
CoRR
(2023)
Qisong Yang
,
Thiago D. Simão
,
Simon H. Tindemans
,
Matthijs T. J. Spaan
Safety-constrained reinforcement learning with a distributional safety critic.
Mach. Learn.
112 (3) (2023)
Yueqi Hou
,
Xiaolong Liang
,
Maolong Lv
,
Qisong Yang
,
Yang Li
Subtask-masked curriculum learning for reinforcement learning with application to UAV maneuver decision-making.
Eng. Appl. Artif. Intell.
125 (2023)
Qisong Yang
,
Matthijs T. J. Spaan
CEM: Constrained Entropy Maximization for Task-Agnostic Safe Exploration.
AAAI
(2023)
Danial Kamran
,
Thiago D. Simão
,
Qisong Yang
,
Canmanie T. Ponnambalam
,
Johannes Fischer
,
Matthijs T. J. Spaan
,
Martin Lauer
A Modern Perspective on Safe Automated Driving for Different Traffic Dynamics Using Constrained Reinforcement Learning.
ITSC
(2022)
Qisong Yang
,
Thiago D. Simão
,
Simon H. Tindemans
,
Matthijs T. J. Spaan
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning.
AAAI
(2021)