Login / Signup
Regret Bounds for Information-Directed Reinforcement Learning.
Botao Hao
Tor Lattimore
Published in:
NeurIPS (2022)
Keyphrases
</>
reinforcement learning
machine learning
learning algorithm
learning process
multi class
mutual information