Login / Signup

Preference-based reinforcement learning: a formal framework and a policy iteration algorithm.

Johannes FürnkranzEyke HüllermeierWeiwei ChengSang-Hyeun Park
Published in: Mach. Learn. (2012)
Keyphrases
  • reinforcement learning
  • markov decision processes
  • optimal policy