C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Penalty Decoding: Well Suppress the Self-Reinforcement Effect in Open-Ended Text Generation.
Wenhong Zhu
Hongkun Hao
Rui Wang
Published in:
CoRR (2023)
Keyphrases
</>
open ended
text generation
natural language generation
learning outcomes
reinforcement learning
natural language
objective function
multiple choice
machine learning
metacognitive processes