Login / Signup
Policy Gradient with Kernel Quadrature.
Satoshi Hayakawa
Tetsuro Morimura
Published in:
Trans. Mach. Learn. Res. (2024)
Keyphrases
</>
policy gradient
reinforcement learning
actor critic
kernel function
kernel methods
gradient method
function approximation
model free reinforcement learning
optimal control
support vector
reinforcement learning algorithms
feature space
dynamic programming
approximation methods
average reward
state space