Login / Signup
Quantum Policy Gradient Algorithm with Optimized Action Decoding.
Nico Meyer
Daniel D. Scherer
Axel Plinge
Christopher Mutschler
Michael J. Hartmann
Published in:
CoRR (2022)
Keyphrases
</>
learning algorithm
computational complexity
cost function
np hard
dynamic programming
simulated annealing
policy gradient
worst case
search space
monte carlo
convergence rate
machine learning
objective function
path planning
optimization method
gradient ascent