Login / Signup
Self-Supervised Online Reward Shaping in Sparse-Reward Environments.
Farzan Memarian
Wonjoon Goo
Rudolf Lioutikov
Ufuk Topcu
Scott Niekum
Published in:
CoRR (2021)
Keyphrases
</>
reward shaping
reinforcement learning
complex domains
reinforcement learning algorithms
online learning
domain knowledge
dynamic environments
markov decision problems