Strategy Generation Based on Reinforcement Learning with Deep Deterministic Policy Gradient for UCAV.
Yunhong MaShuyao BaiYifei ZhaoChao SongJie YangPublished in: ICARCV (2020)
Keyphrases
- policy gradient
- reinforcement learning
- actor critic
- policy search
- function approximation
- reinforcement learning algorithms
- optimal control
- learning algorithm
- model free reinforcement learning
- state space
- policy gradient methods
- gradient method
- reinforcement learning methods
- markov decision processes
- function approximators
- average reward
- single agent
- approximation methods
- variance reduction
- temporal difference learning
- multi agent
- machine learning
- partially observable markov decision processes
- control system