Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning.

Wen Sun J. Andrew Bagnell Byron Boots

Published in: CoRR (2018)

Keyphrases

policy search
reinforcement learning
imitation learning
reinforcement learning algorithms
continuous state
reinforcement learning methods
function approximation
state space
learning algorithm
multi agent
machine learning
control problems
model free
dynamic programming
learning problems
partially observable markov decision processes
maximum margin
markov decision processes
partially observable
markov decision problems
optimal control
policy iteration
reward function
temporal difference
active learning
learning tasks