Implicit Distributional Reinforcement Learning.
Yuguang YueZhendong WangMingyuan ZhouPublished in: CoRR (2020)
Keyphrases
- reinforcement learning
- function approximation
- markov decision processes
- temporal difference
- co occurrence
- reinforcement learning algorithms
- machine learning
- multi agent reinforcement learning
- model free
- state space
- optimal policy
- direct policy search
- robotic control
- domain knowledge
- databases
- database
- evolutionary algorithm
- optimal control
- learning process
- decision making
- information systems
- partially observable
- learning agents
- temporal difference learning
- artificial intelligence
- policy search
- real world