Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback.

Tom Bewley Jonathan Lawry Arthur Richards

Published in: CoRR (2023)

Keyphrases

reinforcement learning
learning algorithm
learning process
prior knowledge
accurate models
learning models
language acquisition
computational models
learning tasks
learning problems
human experts
online learning
cognitive models
previously learned
supervised learning
state abstraction
human decision making
reinforcement learning methods
temporal difference learning
active exploration
structured prediction
cognitive model
qualitative models
reward function
temporal difference
action selection
hidden variables
learning systems
optimal policy