Keyphrases
- inverse reinforcement learning
- receding horizon
- air traffic control
- optimal linear
- partially observable environments
- preference elicitation
- formation control
- reward function
- unmanned aerial vehicles
- temporal difference
- machine learning
- reinforcement learning
- dynamic programming
- mobile robot
- multi robot
- collision avoidance