Offline RL for Natural Language Generation with Implicit Language Q Learning.
Charlie SnellIlya KostrikovYi SuSherry YangSergey LevinePublished in: ICLR (2023)
Keyphrases
- knowledge representation
- natural language generation
- natural language
- reinforcement learning
- natural language processing
- word order
- function approximation
- english text
- text generation
- dialog systems
- reinforcement learning algorithms
- optimal policy
- dialogue system
- state space
- model free
- multi agent
- machine learning
- cooperative
- dialogue management
- learning algorithm
- action selection
- machine translation
- aggregated search
- temporal difference learning
- reinforcement learning methods
- multi agent reinforcement learning
- target language
- markov decision process
- learning classifier systems
- information extraction
- temporal difference methods
- policy iteration
- learning agent
- markov decision processes
- expert systems