Offline Reinforcement Learning with Uncertainty Critic Regularization Based on Density Estimation.
Chao LiFengge WuJunsuo ZhaoPublished in: IJCNN (2023)
Keyphrases
- density estimation
- reinforcement learning
- reproducing kernel hilbert space
- actor critic
- temporal difference
- reinforcement learning algorithms
- mixture model
- probability density
- outlier detection
- density function
- probability density function
- mixture modeling
- policy gradient
- em algorithm
- state space
- density estimators
- density estimates
- multivariate gaussian distribution
- parzen window
- kernel density estimation
- gaussian mixture model
- exponential family
- learning algorithm
- nonparametric density estimation
- data sets
- expectation maximization
- unsupervised learning