Login / Signup
Optimal demand response based dynamic pricing strategy via Multi-Agent Federated Twin Delayed Deep Deterministic policy gradient algorithm.
Haining Ma
Huifeng Zhang
Ding Tian
Dong Yue
Gerhard P. Hancke
Published in:
Eng. Appl. Artif. Intell. (2024)
Keyphrases
</>
np hard
optimal solution
worst case
lower bound
computational complexity
multi agent
search space
dynamic programming
learning algorithm
lot sizing
multiple agents
average reward
policy gradient