Login / Signup
Directed Policy Search Using Relevance Vector Machines.
Ioannis Rexakis
Michail G. Lagoudakis
Published in:
ICTAI (2012)
Keyphrases
</>
relevance vector machines
policy search
reinforcement learning
dynamic programming
continuous state
reinforcement learning algorithms
partially observable markov decision processes
reward function
learning algorithm
supervised learning
model selection
sample complexity
policy gradient