Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling.

Jesus Bujalance Martin Raphaël Chekroun Fabien Moutarde

Published in: CoRR (2021)

Keyphrases

reinforcement learning
actor critic
learning process
policy gradient
learning algorithm
learning tasks
machine learning
active learning
machine learning algorithms
optimal control