Login / Signup
Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control.
Xiang Fan
Yiwei Lyu
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
Published in:
ACL (Findings) (2023)
Keyphrases
</>
language model
reinforcement learning
language modeling
n gram
information retrieval
active learning
learning algorithm
multimedia
hidden markov models
speech recognition
language modelling
probabilistic model
test collection