Login / Signup
Robust Reinforcement Learning via Adversarial training with Langevin Dynamics.
Parameswaran Kamalaruban
Yu-Ting Huang
Ya-Ping Hsieh
Paul Rolland
Cheng Shi
Volkan Cevher
Published in:
CoRR (2020)
Keyphrases
</>
reinforcement learning
multi agent
function approximation
machine learning
decision trees
training examples
dynamical systems
supervised learning
artificial neural networks
online learning
least squares
optimal policy
test set
case study
training process
parameter tuning
robust estimation
model free
action selection