Discovering Reinforcement Learning Algorithms.
Junhyuk OhMatteo HesselWojciech M. CzarneckiZhongwen XuHado van HasseltSatinder SinghDavid SilverPublished in: NeurIPS (2020)
Keyphrases
- reinforcement learning algorithms
- reinforcement learning
- state space
- model free
- markov decision processes
- reinforcement learning problems
- eligibility traces
- temporal difference
- reinforcement learning methods
- learning algorithm
- reward function
- function approximation
- partially observable environments
- policy search
- stochastic games
- reward shaping
- dynamic environments
- higher order
- hidden markov models
- training data