Login / Signup
Actor Loss of Soft Actor Critic Explained.
Thibault Lahire
Published in:
CoRR (2021)
Keyphrases
</>
actor critic
reinforcement learning
optimal control
policy gradient
temporal difference
neuro fuzzy
approximate dynamic programming
gradient method
function approximation
reinforcement learning algorithms
policy iteration
average reward
neural network
sparse representation
dynamical systems