Login / Signup

Cascade Reward Sampling for Efficient Decoding-Time Alignment.

Bolian LiYifan WangAnanth GramaRuqi Zhang
Published in: CoRR (2024)
Keyphrases
  • reinforcement learning
  • genetic algorithm
  • cost effective
  • database
  • data mining
  • website
  • probabilistic model
  • random sampling
  • image alignment