Login / Signup
Efficient Entropy for Policy Gradient with Multidimensional Action Space.
Yiming Zhang
Quan Ho Vuong
Kenny Song
Xiao-Yue Gong
Keith W. Ross
Published in:
CoRR (2018)
Keyphrases
</>
policy gradient
action space
state space
reinforcement learning
multi agent
single agent
optimal control