Login / Signup
Quantum Policy Gradient Algorithm with Optimized Action Decoding.
Nico Meyer
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Michael J. Hartmann
Published in:
ICML (2023)
Keyphrases
</>
learning algorithm
dynamic programming
search space
cost function
np hard
worst case
objective function
monte carlo
policy gradient
optimal solution
computational complexity
mathematical model
path planning
convergence rate
gradient ascent