Policy gradient assisted MAP-Elites.

Olle Nilsson Antoine Cully

Published in: GECCO (2021)

Keyphrases

policy gradient
parametric optimization
actor critic
reinforcement learning
function approximation
gradient method
optimal control
reinforcement learning algorithms
approximation methods
partially observable markov decision processes
model free reinforcement learning
variance reduction
average reward
neural network
feature maps
radial basis function
machine learning