Login / Signup
Composing Entropic Policies using Divergence Correction.
Jonathan J. Hunt
André Barreto
Timothy P. Lillicrap
Nicolas Heess
Published in:
ICML (2019)
Keyphrases
</>
divergence measure
kullback leibler
optimal policy
mutual information
database
markov decision process
neural network
databases
website
clustering algorithm
search algorithm
search engine
markov chain
information theory
role based access control
stochastic approximation
control policies
revenue management