Login / Signup
Entropy annealing for policy mirror descent in continuous time and space.
Deven Sethi
David Siska
Yufei Zhang
Published in:
CoRR (2024)
Keyphrases
</>
search space
real time
neural network
simulated annealing
low dimensional
space time
information theory
markov processes
data sets
high dimensional
optimal policy
dynamical systems
information theoretic
vector space
space requirements