Generative adversarial exploration for reinforcement learning.
Weijun HongMenghui ZhuMinghuan LiuWeinan ZhangMing ZhouYong YuPeng SunPublished in: DAI (2019)
Keyphrases
- reinforcement learning
- active exploration
- exploration strategy
- multi agent
- action selection
- model based reinforcement learning
- exploration exploitation
- function approximation
- exploration exploitation tradeoff
- autonomous learning
- markov decision processes
- data driven
- state space
- dynamic programming
- unsupervised learning
- reinforcement learning algorithms
- active learning
- partially observable
- temporal difference
- model free
- search strategies
- generative model
- optimal policy
- neural network
- interactive exploration
- multi agent reinforcement learning
- transfer learning
- data sets
- robotic control
- learning algorithm
- probabilistic model
- temporal difference learning
- discriminative learning