On Many-Actions Policy Gradient.

Michal Nauman Marek Cygan

Published in: ICML (2023)

Keyphrases

policy gradient
state action
parametric optimization
actor critic
reinforcement learning
optimal control
approximation methods
state transitions
function approximation
model free reinforcement learning
neural network
action space
stochastic games
reinforcement learning algorithms
variance reduction
evaluation function
np hard