Login / Signup
First Results from Using Temporal Difference Learning in Shogi.
Donald F. Beal
Martin C. Smith
Published in:
Computers and Games (1998)
Keyphrases
</>
temporal difference learning
game playing
fixed point
function approximation
evaluation function
temporal difference
reinforcement learning
approximate value iteration
markov decision process
video games
reinforcement learning algorithms
decision making
least squares
monte carlo
markov chain