BSAC: Bayesian Strategy Network Based Soft Actor-Critic in Deep Reinforcement Learning.
Qin YangRamviyas ParasuramanPublished in: CoRR (2022)
Keyphrases
- actor critic
- reinforcement learning
- temporal difference
- policy gradient
- reinforcement learning algorithms
- approximate dynamic programming
- function approximation
- optimal control
- neuro fuzzy
- gradient method
- state space
- policy iteration
- policy gradient methods
- machine learning
- model free
- optimal policy
- supervised learning
- control problems
- rl algorithms
- bayesian networks
- learning algorithm
- control system
- multi agent
- neural network