Sign in

Hindsight Balanced Reward Shaping.

Mengxuan ShaoFeng JiangShaohui LiuKun HanDebin Zhao
Published in: ICONIP (5) (2022)
Keyphrases
  • reward shaping
  • reinforcement learning
  • complex domains
  • reinforcement learning algorithms
  • state space
  • function approximation
  • markov decision problems
  • machine learning
  • training data
  • optimal policy
  • action selection