S-REINFORCE: A Neuro-Symbolic Policy Gradient Approach for Interpretable Reinforcement Learning.
Rajdeep DuttaQincheng WangAnkur SinghDhruv KumarjigudaXiaoli LiSenthilnath JayaveluPublished in: CoRR (2023)
Keyphrases
- policy gradient
- reinforcement learning
- actor critic
- reinforcement learning algorithms
- function approximation
- optimal control
- policy search
- gradient method
- neuro fuzzy
- state space
- model free reinforcement learning
- neural network
- policy gradient methods
- reinforcement learning methods
- model free
- learning algorithm
- artificial neural networks
- state action
- approximation methods
- function approximators
- variance reduction
- approximate dynamic programming
- control problems
- markov decision processes
- learning problems
- monte carlo
- radial basis function
- partially observable markov decision processes
- cost function
- multi agent systems
- natural actor critic