Towards Deeper Deep Reinforcement Learning.
Johan BjorckCarla P. GomesKilian Q. WeinbergerPublished in: CoRR (2021)
Keyphrases
- reinforcement learning
- function approximation
- state space
- machine learning
- control problems
- temporal difference
- dynamic programming
- markov decision processes
- optimal policy
- policy search
- temporal difference learning
- model free
- transfer learning
- perceptual aliasing
- real time
- multi agent
- information systems
- relational reinforcement learning
- active exploration
- belief nets
- transition model
- autonomous learning
- stochastic approximation
- learning problems
- probabilistic model
- hidden markov models
- artificial neural networks