Login / Signup
SOAC: The Soft Option Actor-Critic Architecture.
Chenghao Li
Xiaoteng Ma
Chongjie Zhang
Jun Yang
Li Xia
Qianchuan Zhao
Published in:
CoRR (2020)
Keyphrases
</>
actor critic
approximate dynamic programming
reinforcement learning
optimal control
temporal difference
gradient method
supervised learning
function approximation
reinforcement learning algorithms