Login / Signup
Leveraging class abstraction for commonsense reinforcement learning via residual policy gradient methods.
Niklas Höpner
Ilaria Tiddi
Herke van Hoof
Published in:
CoRR (2022)
Keyphrases
</>
reinforcement learning
policy gradient methods
natural actor critic
policy gradient
multi agent
function approximation
markov chain
actor critic
machine learning
monte carlo
markov decision processes
function approximators
gradient method