Corruption-robust exploration in episodic reinforcement learning.
Thodoris LykourisMax SimchowitzAlex SlivkinsWen SunPublished in: COLT (2021)
Keyphrases
- reinforcement learning
- learning algorithm
- active exploration
- multi agent
- computationally efficient
- action selection
- balancing exploration and exploitation
- machine learning
- model based reinforcement learning
- robotic control
- exploration strategy
- autonomous learning
- robust estimation
- parameter tuning
- optimal policy
- state space
- website
- computer vision