Login / Signup

Efficient Dialogue Complementary Policy Learning via Deep Q-network Policy and Episodic Memory Policy.

Yangyang ZhaoZhenyu WangChangxi ZhuShihan Wang
Published in: EMNLP (1) (2021)
Keyphrases
  • learning algorithm
  • information retrieval
  • reinforcement learning
  • learning process
  • search space
  • supervised learning
  • knowledge acquisition