Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Action Tasks.
Haobo JiangJin XieJian YangPublished in: AAAI (2021)
Keyphrases
- continuous action
- policy search
- reinforcement learning
- continuous state and action spaces
- continuous state
- reinforcement learning algorithms
- action space
- state space
- cooperative
- multi agent
- action selection
- partially observable markov decision processes
- learning algorithm
- neural network
- model free
- np hard
- machine learning