Policy Evaluation and Seeking for Multiagent Reinforcement Learning via Best Response.
Rui YanXiaoming DuanZongying ShiYisheng ZhongJason R. MardenFrancesco BulloPublished in: IEEE Trans. Autom. Control. (2022)
Keyphrases
- multiagent reinforcement learning
- policy evaluation
- reinforcement learning algorithms
- temporal difference
- reinforcement learning
- least squares
- stochastic games
- markov decision processes
- model free
- multiagent systems
- cooperative
- monte carlo
- multi agent
- policy iteration
- function approximation
- variance reduction
- semi parametric
- optimal policy
- finite state
- evaluation function
- state space
- solving problems
- decision problems
- nash equilibria
- average reward
- dynamic programming