Uniformly Conservative Exploration in Reinforcement Learning.
Wanqiao XuYecheng Jason MaKan XuHamsa BastaniOsbert BastaniPublished in: AISTATS (2023)
Keyphrases
- reinforcement learning
- active exploration
- exploration strategy
- action selection
- exploration exploitation
- model based reinforcement learning
- markov decision processes
- function approximation
- exploration exploitation tradeoff
- autonomous learning
- state space
- model free
- temporal difference
- multi agent reinforcement learning
- multi agent
- information visualization
- learning problems
- relational reinforcement learning
- learning process
- robotic control
- decision making