Login / Signup

Automatic shaping and decomposition of reward functions.

Bhaskara Marthi
Published in: ICML (2007)
Keyphrases
  • reward function
  • reinforcement learning
  • state space
  • transition probabilities
  • inverse reinforcement learning
  • optimal policy
  • state variables