Login / Signup
Bandits with partially observable confounded data.
Guy Tennenholtz
Uri Shalit
Shie Mannor
Yonathan Efroni
Published in:
UAI (2021)
Keyphrases
</>
prior knowledge
state space
data structure
search algorithm
general purpose
partially observable
learning algorithm
bayesian networks
reinforcement learning
computational complexity
probability distribution
markov decision processes