Offline Quantum Reinforcement Learning in a Conservative Manner.
Zhihao ChengKaining ZhangLi ShenDacheng TaoPublished in: AAAI (2023)
Keyphrases
- reinforcement learning
- real time
- learning algorithm
- function approximation
- dynamic programming
- state space
- quantum computation
- reinforcement learning algorithms
- learning process
- machine learning
- temporal difference
- data sets
- temporal difference learning
- markov decision processes
- multi agent
- active learning
- multi agent systems
- objective function
- image sequences
- model free
- expert systems
- action space
- learning agents
- quantum mechanics
- autonomous learning
- robotic control