Login / Signup
Cascade Reward Sampling for Efficient Decoding-Time Alignment.
Bolian Li
Yifan Wang
Ananth Grama
Ruqi Zhang
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
genetic algorithm
cost effective
database
data mining
website
probabilistic model
random sampling
image alignment