Login / Signup
The Point to Which Soft Actor-Critic Converges.
Jianfei Ma
Published in:
CoRR (2023)
Keyphrases
</>
actor critic
reinforcement learning
temporal difference
approximate dynamic programming
policy gradient
optimal control
function approximation