Constrained Upper Confidence Reinforcement Learning.
Liyuan ZhengLillian J. RatliffPublished in: CoRR (2020)
Keyphrases
- reinforcement learning
- reactive planning
- state space
- function approximation
- optimal policy
- object recognition
- multi agent
- machine learning
- optimal control
- model free
- temporal difference
- belief state
- reinforcement learning algorithms
- markov decision processes
- temporal difference learning
- robotic control
- learning algorithm