Login / Signup
Reward Gaming in Conditional Text Generation.
Richard Yuanzhe Pang
Vishakh Padmakumar
Thibault Sellam
Ankur P. Parikh
He He
Published in:
CoRR (2022)
Keyphrases
</>
text generation
natural language generation
natural language
reinforcement learning
theorem prover
virtual environment
random field model
bayesian networks
computer games
educational games
data sets
natural language processing
reward function
multi player