Asynchronous, Option-Based Multi-Agent Policy Gradient: A Conditional Reasoning Approach.
Xubo LyuAmin Banitalebi-DehkordiMo ChenYong ZhangPublished in: IROS (2023)
Keyphrases
- policy gradient
- multi agent
- single agent
- reinforcement learning
- cooperative
- actor critic
- function approximation
- multiple agents
- knowledge base
- model free reinforcement learning
- partially observable markov decision processes
- gradient method
- optimal control
- dynamic environments
- state space
- multi agent systems
- reinforcement learning algorithms
- variance reduction
- optimization methods
- real valued
- stochastic games
- neural network