Lazy-MDPs: Towards Interpretable RL by Learning When to Act.

Alexis Jacq Johan Ferret Olivier Pietquin Matthieu Geist

Published in: AAMAS (2022)

Keyphrases

reinforcement learning methods
reinforcement learning
reinforcement learning algorithms
markov decision processes
state space
model free
supervised learning
learning algorithm
machine learning
multi agent
learning process
optimal policy
learning tasks
decision theoretic
action selection
learning problems