Login / Signup
Double Deep Q Network with Huber Reward Function for Cart-Pole Balancing Problem.
Mishra Shaili
Anuja Arora
Published in:
Int. J. Perform. Eng. (2022)
Keyphrases
</>
reward function
markov decision processes
reinforcement learning
semi supervised
network structure
inverse reinforcement learning
data mining
optimal policy
complex networks
multiple agents
reinforcement learning algorithms