Login / Signup
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning.
Mingqi Yuan
Bo Li
Xin Jin
Wenjun Zeng
Published in:
ICML (2023)
Keyphrases
</>
reward shaping
reinforcement learning
reinforcement learning algorithms
complex domains
action selection
state space
function approximation
model free
markov decision problems
markov decision processes
decision making
decision makers
multi agent
markov chain
dynamic programming
learning process
machine learning