Modeling Bellman-error with logistic distribution with applications in reinforcement learning.
Outongyi LvBingxin ZhouLin F. YangPublished in: Neural Networks (2024)
Keyphrases
- reinforcement learning
- learning algorithm
- temporal difference learning
- optimal policy
- linear program
- function approximation
- error rate
- state space
- logistic regression
- machine learning
- error bounds
- probability distribution
- search algorithm
- data distribution
- spatial distribution
- model free
- action selection
- error detection
- reinforcement learning methods
- extreme values
- robotic control