Model-free attitude synchronization for multiple heterogeneous quadrotors via reinforcement learning.
Wanbing ZhaoHao LiuBohui WangPublished in: Int. J. Intell. Syst. (2021)
Keyphrases
- model free
- reinforcement learning
- reinforcement learning algorithms
- function approximation
- temporal difference
- reinforcement learning methods
- state space
- policy iteration
- policy evaluation
- rl algorithms
- learning process
- learning algorithm
- temporal difference learning
- learning tasks
- optimal control
- markov decision processes
- optimal policy
- supervised learning
- feature vectors
- multi agent
- e learning