Deep Reinforcement Learning with Adjustments.

Hamed Khorasgani Haiyan Wang Chetan Gupta Susumu Serita

Published in: CoRR (2021)

Keyphrases

reinforcement learning
function approximation
state space
markov decision processes
database
optimal policy
learning algorithm
stochastic approximation
control problems
temporal difference
model free
deep learning
dynamic programming
machine learning
markov decision process
policy search
robot control
belief nets
reward function
monte carlo
dynamic environments
evolutionary algorithm
learning process
real time