Login / Signup
Automatic shaping and decomposition of reward functions.
Bhaskara Marthi
Published in:
ICML (2007)
Keyphrases
</>
reward function
reinforcement learning
state space
transition probabilities
inverse reinforcement learning
optimal policy
state variables