Login / Signup
Distributed Soft Actor-Critic with Multivariate Reward Representation and Knowledge Distillation.
Dmitry Akimov
Published in:
CoRR (2019)
Keyphrases
</>
reinforcement learning
actor critic
knowledge base
average reward
policy gradient
neural network