Login / Signup

A Policy Iteration Algorithm for Learning from Preference-Based Feedback.

Christian WirthJohannes Fürnkranz
Published in: IDA (2013)
Keyphrases
  • learning algorithm
  • active learning
  • supervised learning
  • learning tasks
  • reinforcement learning