Login / Signup
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.
Ofir Nachum
Mohammad Norouzi
Kelvin Xu
Dale Schuurmans
Published in:
ICLR (Poster) (2018)
Keyphrases
</>
trust region
cost function
support vector machine
simulated annealing
linear programming
newton method