Login / Signup
Efficient Entropy For Policy Gradient with Multi-Dimensional Action Space.
Yiming Zhang
Quan Ho Vuong
Kenny Song
Xiao-Yue Gong
Keith W. Ross
Published in:
ICLR (Workshop) (2018)
Keyphrases
</>
action space
reinforcement learning
real valued
policy gradient
machine learning
learning algorithm
cooperative
np hard
state space