Preference-based reinforcement learning: a formal framework and a policy iteration algorithm.

Published in: Mach. Learn. (2012)

Keyphrases