Status-quo policy gradient in Multi-Agent Reinforcement Learning.
Pinkesh BadjatiyaMausoom SarkarNikaash PuriJayakumar SubramanianAbhishek SinhaSiddharth SinghBalaji KrishnamurthyPublished in: CoRR (2021)
Keyphrases
- status quo
- multi agent reinforcement learning
- policy gradient
- reinforcement learning
- multi agent
- multi agent learning
- single agent
- function approximation
- reinforcement learning algorithms
- gradient method
- stochastic games
- optimal control
- learning agent
- state space
- multi agent systems
- markov decision processes
- learning automata
- average reward
- neural network
- learning process
- variance reduction
- temporal difference
- dynamic programming
- learning capabilities
- transfer learning