A Controllable Lifestyle Simulator for Use in Deep Reinforcement Learning Algorithms.

Libio Gonçalves Braz Allmin Pradhap Singh Susaiyah

Published in: ICASSP (2023)

Keyphrases

reinforcement learning algorithms
reinforcement learning
state space
markov decision processes
model free
reinforcement learning problems
learning algorithm
function approximation
reinforcement learning methods
temporal difference
eligibility traces
reward function
partially observable environments
policy search
tabula rasa
stochastic games
sufficient conditions
reward shaping
multi agent
bayesian networks