Training language models to follow instructions with human feedback.
Long OuyangJeff WuXu JiangDiogo AlmeidaCarroll L. WainwrightPamela MishkinChong ZhangSandhini AgarwalKatarina SlamaAlex RayJohn SchulmanJacob HiltonFraser KeltonLuke MillerMaddie SimensAmanda AskellPeter WelinderPaul F. ChristianoJan LeikeRyan LowePublished in: CoRR (2022)
Keyphrases
- language model
- language modeling
- motor skills
- n gram
- probabilistic model
- document retrieval
- speech recognition
- information retrieval
- language modelling
- test collection
- retrieval model
- smoothing methods
- query expansion
- statistical language models
- vector space model
- document ranking
- training set
- relevance model
- pseudo relevance feedback
- relevance feedback
- passage retrieval
- context sensitive
- document length
- word error rate
- language models for information retrieval
- translation model
- user feedback
- error rate
- okapi bm
- language modeling framework
- language model for information retrieval