Login / Signup
Soft policy optimization using dual-track advantage estimator.
Yubo Huang
Xuechun Wang
Luobao Zou
Zhiwei Zhuang
Weidong Zhang
Published in:
ICDM (2020)
Keyphrases
</>
optimization algorithm
global optimization
optimization problems
least squares
optimal policy
optimization method
optimization process
neural network
constrained optimization
optimization model
data sets
cost function
expected cost
policy makers