Login / Signup
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning.
Xudong Yu
Chenjia Bai
Hongyi Guo
Changhong Wang
Zhen Wang
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
function approximation
real time
machine learning
learning algorithm
case study
wide variety
temporal difference
data sets
information systems
supervised learning
basis functions
transfer learning
action space