Login / Signup
Lazy-MDPs: Towards Interpretable RL by Learning When to Act.
Alexis Jacq
Johan Ferret
Olivier Pietquin
Matthieu Geist
Published in:
AAMAS (2022)
Keyphrases
</>
reinforcement learning methods
reinforcement learning
reinforcement learning algorithms
markov decision processes
state space
model free
supervised learning
learning algorithm
machine learning
multi agent
learning process
optimal policy
learning tasks
decision theoretic
action selection
learning problems