Login / Signup
Preference-based reinforcement learning: a formal framework and a policy iteration algorithm.
Johannes Fürnkranz
Eyke Hüllermeier
Weiwei Cheng
Sang-Hyeun Park
Published in:
Mach. Learn. (2012)
Keyphrases
</>
reinforcement learning
markov decision processes
optimal policy