A Logarithmic Barrier Method For Proximal Policy Optimization.
Cheng ZengHongming ZhangPublished in: CoRR (2018)
Keyphrases
- detection method
- optimization method
- preprocessing
- significant improvement
- support vector machine
- synthetic data
- computational complexity
- constrained optimization
- optimization algorithm
- dynamic programming
- semi supervised
- optimization process
- experimental evaluation
- prior knowledge
- objective function
- optimization procedure
- clustering algorithm
- fully automatic
- optimization problems
- classification accuracy
- training set
- reinforcement learning