Login / Signup
Constrained Upper Confidence Reinforcement Learning.
Liyuan Zheng
Lillian J. Ratliff
Published in:
L4DC (2020)
Keyphrases
</>
reinforcement learning
reactive planning
state space
function approximation
machine learning
multi agent
temporal difference
reinforcement learning algorithms
genetic algorithm
learning algorithm
production system
model free
control structure
temporal difference learning