Deep Reinforcement Learning for Six Degree-of-Freedom Planetary Powered Descent and Landing.
Brian GaudetRichard LinaresRoberto FurfaroPublished in: CoRR (2018)
Keyphrases
- reinforcement learning
- function approximation
- multi agent
- learning process
- optimal policy
- learning algorithm
- stochastic approximation
- state space
- temporal difference learning
- multi agent reinforcement learning
- model free
- optimal control
- learning problems
- markov decision processes
- supervised learning
- decision making
- computer vision