Login / Signup
A Policy Iteration Algorithm for Learning from Preference-Based Feedback.
Christian Wirth
Johannes Fürnkranz
Published in:
IDA (2013)
Keyphrases
</>
learning algorithm
active learning
supervised learning
learning tasks
reinforcement learning