Single-Timescale Stochastic Nonconvex-Concave Optimization for Smooth Nonlinear TD Learning.
Shuang QiuZhuoran YangXiaohan WeiJieping YeZhaoran WangPublished in: CoRR (2020)
Keyphrases
- td learning
- optimization problems
- global optimization
- objective function
- temporal difference
- nonlinear programming
- piecewise linear
- monte carlo
- constrained optimization
- optimization algorithm
- evaluation function
- convex optimization
- function approximation
- fixed point
- machine learning
- multi objective
- evolutionary algorithm
- artificial neural networks
- decision making