Login / Signup
Entropic Policy Composition with Generalized Policy Improvement and Divergence Correction.
Jonathan J. Hunt
André Barreto
Timothy P. Lillicrap
Nicolas Heess
Published in:
CoRR (2018)
Keyphrases
</>
decision process
neural network
optimal policy
action selection
real time
decision trees
significant improvement
policy making