Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback.
Tom BewleyJonathan LawryArthur RichardsPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- learning algorithm
- learning process
- prior knowledge
- accurate models
- learning models
- language acquisition
- computational models
- learning tasks
- learning problems
- human experts
- online learning
- cognitive models
- previously learned
- supervised learning
- state abstraction
- human decision making
- reinforcement learning methods
- temporal difference learning
- active exploration
- structured prediction
- cognitive model
- qualitative models
- reward function
- temporal difference
- action selection
- hidden variables
- learning systems
- optimal policy