BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback.

Published in: CoRR (2024)

Keyphrases