Robust Domain Randomised Reinforcement Learning through Peer-to-Peer Distillation.
Chenyang ZhaoTimothy M. HospedalesPublished in: CoRR (2020)
Keyphrases
- reinforcement learning
- peer to peer
- computationally efficient
- complex domains
- information systems
- state space
- domain specific
- learning algorithm
- digital libraries
- evolutionary algorithm
- semi supervised
- domain independent
- markov decision processes
- function approximation
- temporal difference
- partially observable domains