Sign in

Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value At Risk.

Dohyeong KimSonghwai Oh
Published in: IEEE Robotics Autom. Lett. (2022)
Keyphrases
  • trust region
  • reinforcement learning
  • newton method
  • machine learning
  • genetic algorithm
  • learning algorithm
  • multi view