Login / Signup
Model-Ensemble Trust-Region Policy Optimization.
Thanard Kurutach
Ignasi Clavera
Yan Duan
Aviv Tamar
Pieter Abbeel
Published in:
CoRR (2018)
Keyphrases
</>
objective function
neural network
probabilistic model
optimization procedure
learning algorithm
training data
optimization algorithm
metaheuristic
optimization method
global convergence