Entropy annealing for policy mirror descent in continuous time and space.

Deven Sethi David Siska Yufei Zhang

Published in: CoRR (2024)

Keyphrases

search space
real time
neural network
simulated annealing
low dimensional
space time
information theory
markov processes
data sets
high dimensional
optimal policy
dynamical systems
information theoretic
vector space
space requirements