Login / Signup
Chaining Value Functions for Off-Policy Learning.
Simon Schmitt
John Shawe-Taylor
Hado van Hasselt
Published in:
CoRR (2022)
Keyphrases
</>
learning systems
learning process
knowledge acquisition
real world
reinforcement learning
evolutionary algorithm
probabilistic model
empirical studies
learning analytics
concept learning
databases
data mining
lower bound
artificial neural networks
learning mechanism