Benchmarking reinforcement learning algorithms for demand response applications.

Brida V. Mbuwir Carlo Manna Fred Spiessens Geert Deconinck

Published in: ISGT-Europe (2020)

Keyphrases

reinforcement learning algorithms
reinforcement learning
model free
state space
markov decision processes
reinforcement learning problems
eligibility traces
temporal difference
learning algorithm
function approximation
partially observable environments
dynamic environments
reinforcement learning methods
reward function
policy search
tabula rasa
mobile robot
reward shaping
multiagent reinforcement learning
partially observable
dynamic programming
em algorithm
particle filter
markov chain
least squares