CARL: Conditional-value-at-risk Adversarial Reinforcement Learning.
Mathieu GodboutMaxime HeuilletSharath ChandraRupali BhatiAudrey DurandPublished in: CoRR (2021)
Keyphrases
- reinforcement learning
- multi agent
- function approximation
- model free
- state space
- machine learning
- learning algorithm
- neural network
- reinforcement learning algorithms
- temporal difference
- learning problems
- markov decision processes
- optimal policy
- multi agent reinforcement learning
- robotic control
- supervised learning
- learning process
- control problems
- continuous state
- direct policy search
- reinforcement learning methods
- optimal control
- text mining
- multi agent systems
- data mining