Login / Signup
A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning.
Tianchi Cai
Wenpeng Zhang
Lihong Gu
Xiaodong Zeng
Jinjie Gu
Published in:
CoRR (2021)
Keyphrases
</>
reinforcement learning
optimal policy
partially observable
cost function
markov decision processes
function approximation
data sets
least squares
reinforcement learning problems