Reward-Free Exploration for Reinforcement Learning.
Chi JinAkshay KrishnamurthyMax SimchowitzTiancheng YuPublished in: ICML (2020)
Keyphrases
- reinforcement learning
- exploration strategy
- exploration exploitation
- action selection
- active exploration
- function approximation
- reinforcement learning algorithms
- learning algorithm
- model based reinforcement learning
- eligibility traces
- reward function
- state space
- markov decision processes
- multi agent
- optimal policy
- average reward
- multi agent reinforcement learning
- balancing exploration and exploitation
- machine learning
- partially observable environments
- learning process
- model free
- autonomous learning
- control policy
- learning capabilities
- temporal difference
- policy search
- supervised learning
- state action