Proximal Policy Optimization Algorithms.

John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov

Published in: CoRR (2017)

Keyphrases

optimization problems
learning algorithm
optimization methods
computationally efficient
times faster
data structure
discrete optimization
combinatorial optimization
inverse reinforcement learning
stochastic search
recently developed
benchmark datasets
theoretical analysis
worst case
computational cost
significant improvement
lower bound
computational complexity
image segmentation