Login / Signup
State Deviation Correction for Offline Reinforcement Learning.
Hongchang Zhang
Jianzhun Shao
Yuhang Jiang
Shuncheng He
Guanwen Zhang
Xiangyang Ji
Published in:
AAAI (2022)
Keyphrases
</>
reinforcement learning
state space
real time
genetic algorithm
multiscale
transition model
neural network
machine learning
information retrieval
learning environment
search algorithm
function approximation
markov decision process