Login / Signup

Optimal demand response based dynamic pricing strategy via Multi-Agent Federated Twin Delayed Deep Deterministic policy gradient algorithm.

Haining MaHuifeng ZhangDing TianDong YueGerhard P. Hancke
Published in: Eng. Appl. Artif. Intell. (2024)
Keyphrases
  • np hard
  • optimal solution
  • worst case
  • lower bound
  • computational complexity
  • multi agent
  • search space
  • dynamic programming
  • learning algorithm
  • lot sizing
  • multiple agents
  • average reward
  • policy gradient