Sign in

Offline-Online Actor-Critic.

Xuesong WangDiyuan HouLongyang HuangYuhu Cheng
Published in: IEEE Trans. Artif. Intell. (2024)
Keyphrases
  • actor critic
  • real time
  • reinforcement learning
  • neural network
  • function approximation
  • machine learning
  • optimal control
  • average reward
  • approximate dynamic programming
  • multi agent
  • policy gradient