Robust Domain Randomised Reinforcement Learning through Peer-to-Peer Distillation.

Chenyang Zhao Timothy M. Hospedales

Published in: CoRR (2020)

Keyphrases

reinforcement learning
peer to peer
computationally efficient
complex domains
information systems
state space
domain specific
learning algorithm
digital libraries
evolutionary algorithm
semi supervised
domain independent
markov decision processes
function approximation
temporal difference
partially observable domains