Login / Signup
Efficient Offline Reinforcement Learning: The Critic is Critical.
Adam Jelley
Trevor McInroe
Sam Devlin
Amos J. Storkey
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
function approximation
temporal difference
lightweight
optimal control
reinforcement learning algorithms
learning algorithm
computationally expensive
real time
artificial intelligence
decision making
monte carlo
actor critic