Login / Signup
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution.
Vihang P. Patil
Markus Hofmarcher
Marius-Constantin Dinu
Matthias Dorfer
Patrick M. Blies
Johannes Brandstetter
José Antonio Arjona-Medina
Sepp Hochreiter
Published in:
ICML (2022)
Keyphrases
</>
reinforcement learning
learning process
learning systems
learning algorithm
active learning
knowledge acquisition
learning tasks
incremental learning
database
learning scheme
data sets
website
bayesian networks
artificial neural networks
learning problems
science education