Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints.
Donghao LiRuiquan HuangCong ShenJing YangPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- active exploration
- action selection
- function approximation
- real time
- state space
- pairwise
- constrained optimization
- learning algorithm
- autonomous learning
- exploration strategy
- learning process
- transfer learning
- geometric constraints
- multi agent reinforcement learning
- database
- model based reinforcement learning