Modeling Bellman-error with logistic distribution with applications in reinforcement learning.

Outongyi Lv Bingxin Zhou Lin F. Yang

Published in: Neural Networks (2024)

Keyphrases

reinforcement learning
learning algorithm
temporal difference learning
optimal policy
linear program
function approximation
error rate
state space
logistic regression
machine learning
error bounds
probability distribution
search algorithm
data distribution
spatial distribution
model free
action selection
error detection
reinforcement learning methods
extreme values
robotic control