Deep Reinforcement Learning for Six Degree-of-Freedom Planetary Powered Descent and Landing.

Brian Gaudet Richard Linares Roberto Furfaro

Published in: CoRR (2018)

Keyphrases

reinforcement learning
function approximation
multi agent
learning process
optimal policy
learning algorithm
stochastic approximation
state space
temporal difference learning
multi agent reinforcement learning
model free
optimal control
learning problems
markov decision processes
supervised learning
decision making
computer vision