Exploring More When It Needs in Deep Reinforcement Learning.
Youtian GuoQi GaoPublished in: CoRR (2021)
Keyphrases
- reinforcement learning
- reinforcement learning algorithms
- function approximation
- robotic control
- multi agent
- state space
- machine learning
- learning agents
- markov decision processes
- belief nets
- policy search
- multi agent reinforcement learning
- stochastic approximation
- reinforcement learning methods
- temporal difference
- learning capabilities
- data sets
- search engine
- learning agent
- deep learning
- dynamic programming
- information systems
- action selection
- model free
- image sequences
- bayesian networks
- optimal policy
- learning process