Implicit Distributional Reinforcement Learning.
Yuguang YueZhendong WangMingyuan ZhouPublished in: NeurIPS (2020)
Keyphrases
- reinforcement learning
- function approximation
- state space
- reinforcement learning algorithms
- co occurrence
- learning algorithm
- temporal difference
- markov decision processes
- multi agent
- robotic control
- relational reinforcement learning
- multi agent reinforcement learning
- optimal policy
- dynamic programming
- information retrieval
- machine learning
- temporal difference learning
- model free
- learning classifier systems
- optimal control
- action selection
- artificial intelligence
- supervised learning
- partially observable
- control problems
- search engine
- objective function
- policy search
- databases
- perceptual aliasing
- real time