Policy Gradient Method For Robust Reinforcement Learning.
Yue WangShaofeng ZouPublished in: CoRR (2022)
Keyphrases
- gradient method
- actor critic
- policy gradient
- reinforcement learning
- optimal policy
- convergence rate
- convex formulation
- function approximation
- markov decision process
- negative matrix factorization
- step size
- machine learning
- optimization methods
- action selection
- wavelet coefficients
- multiresolution
- function approximators
- image segmentation
- learning algorithm