Login / Signup

Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback.

Khanh NguyenHal Daumé IIIJordan L. Boyd-Graber
Published in: EMNLP (2017)
Keyphrases