CARL: Conditional-value-at-risk Adversarial Reinforcement Learning.

Mathieu Godbout Maxime Heuillet Sharath Chandra Rupali Bhati Audrey Durand

Published in: CoRR (2021)

Keyphrases

reinforcement learning
multi agent
function approximation
model free
state space
machine learning
learning algorithm
neural network
reinforcement learning algorithms
temporal difference
learning problems
markov decision processes
optimal policy
multi agent reinforcement learning
robotic control
supervised learning
learning process
control problems
continuous state
direct policy search
reinforcement learning methods
optimal control
text mining
multi agent systems
data mining