Improving Sample-Efficiency in Reinforcement Learning for Dialogue Systems by Using Trainable-Action-Mask.

Published in: ICASSP (2020)

Keyphrases