Login / Signup
Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning.
Dilip Arumugam
Benjamin Van Roy
Published in:
CoRR (2022)
Keyphrases
</>
reinforcement learning
mathematical model
computational model
objective function
learning algorithm
high level
probabilistic model
probability distribution
management system
parameter estimation
experimental data
dynamic programming
em algorithm
statistical model
formal model
model free