Offline Quantum Reinforcement Learning in a Conservative Manner.

Zhihao Cheng Kaining Zhang Li Shen Dacheng Tao

Published in: AAAI (2023)

Keyphrases

reinforcement learning
real time
learning algorithm
function approximation
dynamic programming
state space
quantum computation
reinforcement learning algorithms
learning process
machine learning
temporal difference
data sets
temporal difference learning
markov decision processes
multi agent
active learning
multi agent systems
objective function
image sequences
model free
expert systems
action space
learning agents
quantum mechanics
autonomous learning
robotic control