trlX: A Framework for Large Scale Reinforcement Learning from Human Feedback.
Alexander HavrillaMaksym ZhuravinskyiDuy PhungAman TiwariJonathan TowStella BidermanQuentin AnthonyLouis CastricatoPublished in: EMNLP (2023)
Keyphrases
- reinforcement learning
- theoretical framework
- main contribution
- information systems
- small scale
- function approximation
- artificial intelligence
- learning algorithm
- computer vision
- information retrieval
- multi agent
- real life
- probabilistic model
- semi supervised
- machine learning
- computational model
- data mining
- neural network
- data sets