Action Candidate Based Clipped Double Q-learning for Discrete and Continuous Action Tasks.

Haobo Jiang Jin Xie Jian Yang

Published in: AAAI (2021)

Keyphrases

continuous action
policy search
reinforcement learning
continuous state and action spaces
continuous state
reinforcement learning algorithms
action space
state space
cooperative
multi agent
action selection
partially observable markov decision processes
learning algorithm
neural network
model free
np hard
machine learning