Air-to-ground shepherd problem: An action-delay reinforcement learning approach.

Jiangcheng Zhu Chao Xu

Published in: ACC (2017)

Keyphrases

reinforcement learning
action selection
action space
reward shaping
partially observable domains
transition model
markov decision processes
state action
reinforcement learning algorithms
temporal difference
function approximation
state space
robotic control
optimal policy
machine learning
temporal difference learning
reinforcement learning methods
learning problems
neural network
transfer learning
continuous state
learning process
policy search
spatio temporal