Login / Signup
Achieving efficient interpretability of reinforcement learning via policy distillation and selective input gradient regularization.
Jinwei Xing
Takashi Nagata
Xinyun Zou
Emre Neftci
Jeffrey L. Krichmar
Published in:
Neural Networks (2023)
Keyphrases
</>
supply chain
reinforcement learning
optimal policy
policy gradient
markov decision process
policy search
machine learning
image processing
multiscale
state space
reinforcement learning algorithms
partially observable environments