Strategy Generation Based on Reinforcement Learning with Deep Deterministic Policy Gradient for UCAV.

Yunhong Ma Shuyao Bai Yifei Zhao Chao Song Jie Yang

Published in: ICARCV (2020)

Keyphrases

policy gradient
reinforcement learning
actor critic
policy search
function approximation
reinforcement learning algorithms
optimal control
learning algorithm
model free reinforcement learning
state space
policy gradient methods
gradient method
reinforcement learning methods
markov decision processes
function approximators
average reward
single agent
approximation methods
variance reduction
temporal difference learning
multi agent
machine learning
partially observable markov decision processes
control system