Login / Signup
Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm.
Róbert Busa-Fekete
Balázs Szörényi
Paul Weng
Weiwei Cheng
Eyke Hüllermeier
Published in:
Mach. Learn. (2014)
Keyphrases
</>
reinforcement learning
direct policy search
learning algorithm
optimal solution
dynamic programming
machine learning
mathematical model
mountain car
neural network
state space
support vector machine svm
transfer learning
markov decision processes
evolutionary computation
function approximation