Online Reinforcement Learning in Stochastic Games.
Chen-Yu WeiYi-Te HongChi-Jen LuPublished in: CoRR (2017)
Keyphrases
- stochastic games
- reinforcement learning
- reinforcement learning algorithms
- multi agent reinforcement learning
- markov decision processes
- multiagent reinforcement learning
- average reward
- rl algorithms
- state action
- multi agent
- state space
- learning automata
- learning agent
- nash equilibria
- function approximation
- optimal policy
- cooperative
- nash equilibrium
- single agent
- repeated games
- model free
- supervised learning
- learning process
- learning algorithm
- reward function
- policy iteration
- temporal difference
- machine learning
- finite state
- least squares
- dynamic programming