Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization.
Eivind BøhnErlend M. CoatesSigne MoeTor Arne JohansenPublished in: CoRR (2019)
Keyphrases
- reinforcement learning
- control policy
- optimal policy
- control policies
- action selection
- unmanned aerial vehicles
- control problems
- optimal control
- optimization algorithm
- control system
- policy search
- learning algorithm
- action space
- markov decision process
- robot control
- approximate dynamic programming
- reinforcement learning algorithms
- adaptive control
- infinite horizon
- control method
- state space
- multi agent
- neural network
- partially observable
- model free
- policy iteration
- control algorithm
- function approximation
- optimization method
- reliability analysis
- dynamic environments
- policy evaluation
- state and action spaces