Login / Signup

Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling.

Yunfan LiYiran WangYu ChengLin Yang
Published in: CoRR (2023)
Keyphrases
  • policy gradient
  • variance reduction
  • function approximation
  • multi agent
  • action selection
  • parametric optimization