Self-Exploring Language Models: Active Preference Elicitation for Online Alignment.
Shenao ZhangDonghan YuHiteshi SharmaZiyi YangShuohang WangHany HassanZhaoran WangPublished in: CoRR (2024)
Keyphrases
- language model
- preference elicitation
- language modeling
- n gram
- probabilistic model
- utility function
- document retrieval
- language modelling
- retrieval model
- statistical language models
- information retrieval
- query expansion
- language models for information retrieval
- test collection
- multi criteria
- document ranking
- decision theory
- smoothing methods
- machine learning
- cross lingual
- relevance model