Towards better generalization in quadrotor landing using deep reinforcement learning.
Jiawei WangTeng WangZichen HeWenzhe CaiChangyin SunPublished in: Appl. Intell. (2023)
Keyphrases
- reinforcement learning
- function approximation
- learning algorithm
- information systems
- multi agent
- markov decision processes
- reinforcement learning algorithms
- data sets
- information retrieval
- case study
- dynamic programming
- state space
- temporal difference
- temporal difference learning
- matlab simulink
- multi agent reinforcement learning