Login / Signup
State Representation Learning for Minimax Deep Deterministic Policy Gradient.
Dapeng Hu
Xuesong Jiang
Xiumei Wei
Jian Wang
Published in:
KSEM (1) (2019)
Keyphrases
</>
policy gradient
learning process
actor critic
reinforcement learning
search algorithm
learning tasks
mobile robot
supervised learning
state action