C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm.
Róbert Busa-Fekete
Balázs Szörényi
Paul Weng
Weiwei Cheng
Eyke Hüllermeier
Published in:
Mach. Learn. (2014)
Keyphrases
</>
reinforcement learning
direct policy search
learning algorithm
optimal solution
dynamic programming
machine learning
mathematical model
mountain car
neural network
state space
support vector machine svm
transfer learning
markov decision processes
evolutionary computation
function approximation