Automatic shaping and decomposition of reward functions.

Bhaskara Marthi

Published in: ICML (2007)

Keyphrases

reward function
reinforcement learning
state space
transition probabilities
inverse reinforcement learning
optimal policy
state variables