A Controllable Lifestyle Simulator for Use in Deep Reinforcement Learning Algorithms.
Libio Gonçalves BrazAllmin Pradhap Singh SusaiyahPublished in: ICASSP (2023)
Keyphrases
- reinforcement learning algorithms
- reinforcement learning
- state space
- markov decision processes
- model free
- reinforcement learning problems
- learning algorithm
- function approximation
- reinforcement learning methods
- temporal difference
- eligibility traces
- reward function
- partially observable environments
- policy search
- tabula rasa
- stochastic games
- sufficient conditions
- reward shaping
- multi agent
- bayesian networks