Login / Signup
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution.
Vihang P. Patil
Markus Hofmarcher
Marius-Constantin Dinu
Matthias Dorfer
Patrick M. Blies
Johannes Brandstetter
Jose A. Arjona-Medina
Sepp Hochreiter
Published in:
CoRR (2020)
Keyphrases
</>
reinforcement learning
learning algorithm
learning process
supervised learning
data mining
learning systems
learning scheme
real time
genetic algorithm
case study
learning tasks
learning problems
incremental learning
learning mechanism