Login / Signup
Potential-based reward shaping for POMDPs.
Adam Eck
Leen-Kiat Soh
Sam Devlin
Daniel Kudenko
Published in:
AAMAS (2013)
Keyphrases
</>
reinforcement learning
reward shaping
dynamic programming
markov decision processes
knowledge base
multi agent
search algorithm
markov decision problems
continuous state