Keyphrases
- reinforcement learning
- game playing
- temporal difference learning
- imperfect information
- board game
- games played
- temporal difference
- evaluation function
- function approximation
- state space
- markov decision processes
- artificial intelligence
- reinforcement learning algorithms
- learning algorithm
- optimal policy
- game tree search
- action space
- machine learning
- game tree
- deep learning
- learning agents
- model free
- supervised learning
- dynamic programming
- multi agent
- action selection
- human players
- multi agent reinforcement learning
- transition model
- computer chess
- minimax search