Deep Reinforcement Learning for Morpion Solitaire.

Boris Doux Benjamin Négrevergne Tristan Cazenave

Published in: ACG (2021)

Keyphrases

reinforcement learning
integer programming
function approximation
reinforcement learning algorithms
optimal policy
temporal difference
model free
neural network
control problems
learning process
state space
robotic control
learning capabilities
markov decision processes
dynamic programming
artificial neural networks
learning problems
supervised learning
special case
website
function approximators
reinforcement learning methods
stochastic approximation
continuous state
multi agent reinforcement learning
policy search
e learning