An optimal POMDP-based anti-jamming policy for cognitive radar.
Xiaofeng JiangFeng ZhouJian YangHongsheng XiPublished in: CASE (2017)
Keyphrases
- optimal policy
- radar signal
- markov decision process
- average reward
- partially observable markov decision processes
- markov decision processes
- asymptotically optimal
- optimal solution
- partially observable
- model free reinforcement learning
- partially observable markov decision process
- control policy
- average cost
- learning algorithm
- finite state
- signal processing
- policy evaluation
- state space
- allocation policy
- dynamic programming
- expected cost
- reward function
- information processing
- least squares
- policy search
- multi agent