Action Candidate Driven Clipped Double Q-Learning for Discrete and Continuous Action Tasks.
Haobo JiangGuangyu LiJin XieJian YangPublished in: IEEE Trans. Neural Networks Learn. Syst. (2024)
Keyphrases
- continuous action
- policy search
- continuous state and action spaces
- reinforcement learning
- continuous state
- action space
- state space
- reinforcement learning algorithms
- partially observable markov decision processes
- function approximation
- cooperative
- learning algorithm
- dynamic programming
- optimal policy
- action selection
- multi agent
- model free