Optimal Demand Response Using Device Based Reinforcement Learning.

Zheng Wen Daniel O'Neill Hamid Reza Maei

Published in: CoRR (2014)

Keyphrases

reinforcement learning
optimal control
dynamic programming
state space
reinforcement learning algorithms
genetic algorithm
worst case
machine learning
optimal solution
profit maximizing
budget constraints
inventory systems
holding cost
average reward
lead time
function approximation
markov decision processes
optimal policy
markov chain
supply chain
multi agent