Sign in

Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms.

Yurou ChenFengyi ZhangZhiyong Liu
Published in: Neural Networks (2024)
Keyphrases
  • trade off
  • bias variance
  • recursive least squares
  • learning algorithm
  • policy iteration
  • least squares
  • optimal control
  • markov decision processes
  • optimization methods
  • adaptive control