Login / Signup
Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL.
Yang Yue
Rui Lu
Bingyi Kang
Shiji Song
Gao Huang
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
real time
function approximation
information retrieval
multi agent
data mining
machine learning
learning algorithm
information systems
decision making
learning process
mutual information
transfer learning