Login / Signup
Offline-Online Actor-Critic.
Xuesong Wang
Diyuan Hou
Longyang Huang
Yuhu Cheng
Published in:
IEEE Trans. Artif. Intell. (2024)
Keyphrases
</>
actor critic
real time
reinforcement learning
neural network
function approximation
machine learning
optimal control
average reward
approximate dynamic programming
multi agent
policy gradient