Constrained Upper Confidence Reinforcement Learning.

Liyuan Zheng Lillian J. Ratliff

Published in: L4DC (2020)

Keyphrases

reinforcement learning
reactive planning
state space
function approximation
machine learning
multi agent
temporal difference
reinforcement learning algorithms
genetic algorithm
learning algorithm
production system
model free
control structure
temporal difference learning