Training language models to follow instructions with human feedback.
Long OuyangJeffrey WuXu JiangDiogo AlmeidaCarroll L. WainwrightPamela MishkinChong ZhangSandhini AgarwalKatarina SlamaAlex RayJohn SchulmanJacob HiltonFraser KeltonLuke MillerMaddie SimensAmanda AskellPeter WelinderPaul F. ChristianoJan LeikeRyan LowePublished in: NeurIPS (2022)
Keyphrases
- language model
- language modeling
- motor skills
- n gram
- document retrieval
- probabilistic model
- information retrieval
- language modelling
- query expansion
- speech recognition
- test collection
- context sensitive
- retrieval model
- document ranking
- statistical language models
- query terms
- smoothing methods
- training set
- word error rate
- search engine
- pseudo relevance feedback
- vector space model
- relevance model
- retrieval effectiveness
- ad hoc information retrieval
- statistical language modeling
- language models for information retrieval
- language model for information retrieval