Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition.

Ryuichi Takanobu Runze Liang Minlie Huang

Published in: ACL (2020)

Keyphrases

reinforcement learning
multi agent
learning process
learning systems
cooperative
learning algorithm
learning tasks
supervised learning
online learning
learning experience
autonomous agents
learning problems
inductive inference
action selection