Login / Signup
An Improved Scheduling with Advantage Actor-Critic for Storm Workloads.
Gaoqiang Dong
Jia Wang
Mingjing Wang
Tingting Su
Published in:
CoRR (2023)
Keyphrases
</>
actor critic
reinforcement learning
optimal control
policy gradient
neuro fuzzy
temporal difference
function approximation
gradient method
approximate dynamic programming
policy iteration
average reward