Direct and indirect reinforcement learning.

Yang Guan Shengbo Eben Li Jingliang Duan Jie Li Yangang Ren Qi Sun Bo Cheng

Published in: Int. J. Intell. Syst. (2021)

Keyphrases

reinforcement learning
model free
temporal difference
reinforcement learning algorithms
optimal policy
function approximation
multi agent reinforcement learning
temporal difference learning
state space
optimal control
policy search
databases
control problems
action selection
markov decision processes
learning process
case study
information systems
real world
neural network