Login / Signup
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling.
Jesus Bujalance Martin
Raphaël Chekroun
Fabien Moutarde
Published in:
CoRR (2021)
Keyphrases
</>
reinforcement learning
actor critic
learning process
policy gradient
learning algorithm
learning tasks
machine learning
active learning
machine learning algorithms
optimal control