Multi-UAV Cooperative Short-Range Combat via Attention-Based Reinforcement Learning using Individual Reward Shaping.
Tianle ZhangTenghai QiuZhen LiuZhiqiang PuJianqiang YiJinying ZhuRuiguang HuPublished in: IROS (2022)
Keyphrases
- reward shaping
- short range
- reinforcement learning
- long range
- cooperative
- complex domains
- wireless communication
- reinforcement learning algorithms
- multi agent
- function approximation
- markov decision processes
- state space
- markov decision problems
- machine learning
- path planning
- optimal policy
- model free
- dynamic environments
- dynamic programming
- wireless sensor networks
- multi agent systems
- learning algorithm