Login / Signup
Online Emission Policy Selection for Radar Antijamming Using Bandit-Optimized Policy Search.
Yuyuan Fang
Song Wei
Lei Zhang
Zhenhua Wu
Jianxin Wu
Published in:
IEEE Trans. Aerosp. Electron. Syst. (2024)
Keyphrases
</>
policy search
reinforcement learning
continuous state
policy gradient
reinforcement learning algorithms
dynamic programming
partially observable markov decision processes
markov decision problems
neural network
markov chain