Login / Signup
Learning Continuous 3-DoF Air-to-Air Close-in Combat Strategy using Proximal Policy Optimization.
Luntong Li
Zhiming Zhou
Jiajun Chai
Zhen Liu
Yuanheng Zhu
Jianqiang Yi
Published in:
CoG (2022)
Keyphrases
</>
learning algorithm
knowledge acquisition
optimization problems
learning problems
global optimization
computer vision
learning systems
mobile learning
learning tasks
training data
reinforcement learning
online learning
constrained optimization
learning mechanism
continuous domains
function approximators