Combining Evolution and Deep Reinforcement Learning for Policy Search: a Survey.
Olivier SigaudPublished in: CoRR (2022)
Keyphrases
- policy search
- reinforcement learning
- reinforcement learning algorithms
- continuous state
- dynamic programming
- state space
- function approximation
- continuous action
- policy gradient
- markov decision processes
- reward function
- machine learning
- markov decision problems
- robot navigation
- function approximators
- temporal difference
- transfer learning
- multi agent systems
- learning algorithm