Action Guidance with MCTS for Deep Reinforcement Learning.
Bilal KartalPablo Hernandez-LealMatthew E. TaylorPublished in: CoRR (2019)
Keyphrases
- reinforcement learning
- action selection
- partially observable domains
- action space
- reward shaping
- state space
- function approximation
- state action
- learning algorithm
- transition model
- monte carlo tree search
- temporal difference
- deep learning
- genetic algorithm
- robotic control
- reinforcement learning methods
- partially observable
- model free
- optimal policy
- multi agent
- learning classifier systems
- markov decision processes
- sensory inputs
- transfer learning
- agent learns
- markov chain
- machine learning
- fitted q iteration