Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization.
Rajkumar RamamurthyPrithviraj AmmanabroluKianté BrantleyJack HesselRafet SifaChristian BauckhageHannaneh HajishirziYejin ChoiPublished in: ICLR (2023)
Keyphrases
- building blocks
- natural language processing
- natural language
- reinforcement learning
- optimal policy
- machine learning
- language processing
- policy search
- semantic analysis
- information extraction
- knowledge representation
- optimization algorithm
- text mining
- natural language generation
- linguistic analysis
- text processing
- action selection
- natural language understanding
- question answering
- machine translation
- wordnet
- artificial intelligence
- named entity recognition
- dialogue system
- computational linguistics
- policy iteration
- state space
- linguistic knowledge
- control policies
- state action
- policy evaluation
- partially observable domains
- action space
- partially observable
- reinforcement learning algorithms
- function approximation
- word sense disambiguation
- markov decision processes
- optimization problems
- markov decision process
- partially observable markov decision processes
- model free
- policy gradient
- state and action spaces
- databases