TD(0)-Replay: An Efficient Model-Free Planning with full Replay.

Abdulrahman Altahhan

Published in: IJCNN (2018)

Keyphrases

model free
temporal difference
reinforcement learning algorithms
reinforcement learning
function approximation
policy evaluation
policy iteration
temporal difference learning
planning problems
learning algorithm
genetic algorithm
state space
evaluation function
reinforcement learning methods
impedance control