Publication: The exploration/exploitation trade-off in Reinforcement Learning for dialogue management.