Login / Signup
P3O: Policy-on Policy-off Policy Optimization.
Rasool Fakoor
Pratik Chaudhari
Alexander J. Smola
Published in:
UAI (2019)
Keyphrases
</>
database
policy making
optimization algorithm
optimal policy
databases
reinforcement learning
optimization problems
action selection
policy search