Reinforcement Learning Guided by Provable Normative Compliance.

Emery A. Neufeld

Published in: ICAART (3) (2022)

Keyphrases

reinforcement learning
function approximation
multi agent
state space
multi agent reinforcement learning
learning algorithm
multi agent systems
model free
direct policy search
temporal difference learning
control problems
partially observable
reinforcement learning algorithms
temporal difference
learning problems
transfer learning
machine learning
markov decision processes
dynamic programming
optimal control
action selection
optimal policy
learning process
case study
reinforcement learning methods
stochastic approximation
policy search
neural network