• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling.

Yunfan LiYiran WangYu ChengLin Yang
Published in: CoRR (2023)
Keyphrases
  • policy gradient
  • variance reduction
  • function approximation
  • multi agent
  • action selection
  • parametric optimization