Login / Signup
Hindsight Learning for MDPs with Exogenous Inputs.
Sean R. Sinclair
Felipe Vieira Frujeri
Ching-An Cheng
Luke Marshall
Hugo De Oliveira Barbalho
Jingling Li
Jennifer Neville
Ishai Menache
Adith Swaminathan
Published in:
ICML (2023)
Keyphrases
</>
reinforcement learning
learning algorithm
learning systems
learning process
markov decision processes
search space
state space
supervised learning
online learning
unsupervised learning
optimal policy
mobile learning
learning problems
inductive inference
multiple agents