Login / Signup
Hindsight Learning for MDPs with Exogenous Inputs.
Sean R. Sinclair
Felipe Frujeri
Ching-An Cheng
Adith Swaminathan
Published in:
CoRR (2022)
Keyphrases
</>
learning algorithm
reinforcement learning
learning process
prior knowledge
neural network
online learning
learning systems
data sets
least squares
supervised learning
unsupervised learning
action selection