Towards Faster Reinforcement Learning of Quantum Circuit Optimisation: Exponential Reward Functions.
Ioana MoflicAlexandru PalerPublished in: NANOARCH (2023)
Keyphrases
- reward function
- reinforcement learning
- reinforcement learning algorithms
- logic circuits
- markov decision processes
- policy search
- state space
- optimal policy
- inverse reinforcement learning
- markov decision process
- partially observable
- transition probabilities
- multiple agents
- transition model
- function approximation
- control policies
- machine learning
- state action
- quantum computing
- model free
- multi agent
- markov chain
- data mining
- action selection
- learning algorithm
- markov decision problems
- learning agent
- state variables
- average reward
- probability distribution
- generative model
- probabilistic model
- bayesian networks
- markov models
- initially unknown
- temporal difference