EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning.
Kinjal BasuKeerthiram MurugesanSubhajit ChaudhuryMurray CampbellKartik TalamadupulaTim KlingerPublished in: CoRR (2024)
Keyphrases
- reinforcement learning
- active exploration
- action selection
- exploration strategy
- function approximation
- knowledge base
- model based reinforcement learning
- exploration exploitation
- autonomous learning
- exploration exploitation tradeoff
- reasoning tasks
- optimal policy
- temporal difference
- automated reasoning
- keywords
- qualitative reasoning
- knowledge representation
- reasoning process
- legal reasoning
- balancing exploration and exploitation
- metadata
- multi agent
- machine learning
- multimedia
- textual features
- dynamic programming
- model based reasoning
- reasoning systems
- markov decision process