PRACM: Predictive Rewards for Actor-Critic with Mixing Function in Multi-Agent Reinforcement Learning.
Sheng YuBo LiuWei ZhuShuhong LiuPublished in: KSEM (4) (2023)
Keyphrases
- multi agent reinforcement learning
- reinforcement learning
- actor critic
- policy gradient
- function approximation
- temporal difference
- reinforcement learning algorithms
- state space
- neuro fuzzy
- learning agents
- multi agent
- machine learning
- reward function
- function approximators
- control policy
- temporal difference learning