Statistical Rejection Sampling Improves Preference Optimization.
Tianqi LiuYao ZhaoRishabh JoshiMisha KhalmanMohammad SalehPeter J. LiuJialu LiuPublished in: ICLR (2024)
Keyphrases
- statistical analysis
- optimization algorithm
- hypothesis testing
- optimization process
- statistical models
- optimization method
- individual preferences
- linear programming
- constrained optimization
- database
- information theoretic
- statistical inference
- confidence intervals
- random sampling
- global optimization
- statistical methods
- monte carlo
- sample size
- data driven
- optimization problems
- multi objective
- data sets