Keyphrases
- stochastic games
- markov decision processes
- average reward
- infinite horizon
- optimal policy
- long run
- games with incomplete information
- multiagent reinforcement learning
- state space
- finite state
- dynamic programming
- reinforcement learning
- finite horizon
- average cost
- optimal control
- learning automata
- multi agent
- nash equilibria
- policy iteration
- reinforcement learning algorithms
- nash equilibrium
- markov decision process
- repeated games
- partially observable
- upper bound
- search space