Action Guidance with MCTS for Deep Reinforcement Learning.

Bilal Kartal Pablo Hernandez-Leal Matthew E. Taylor

Published in: CoRR (2019)

Keyphrases

reinforcement learning
action selection
partially observable domains
action space
reward shaping
state space
function approximation
state action
learning algorithm
transition model
monte carlo tree search
temporal difference
deep learning
genetic algorithm
robotic control
reinforcement learning methods
partially observable
model free
optimal policy
multi agent
learning classifier systems
markov decision processes
sensory inputs
transfer learning
agent learns
markov chain
machine learning
fitted q iteration