Login / Signup
Soft policy optimization using dual-track advantage estimator.
Yubo Huang
Xuechun Wang
Luobao Zou
Zhiwei Zhuang
Weidong Zhang
Published in:
CoRR (2020)
Keyphrases
</>
optimization algorithm
global optimization
maximum likelihood
least squares
optimal policy
optimization method
constrained optimization
discrete optimization
neural network
image restoration
kalman filter
confidence intervals
primal dual
dual formulation