Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation.

Yu Chen Xiangcheng Zhang Siwei Wang Longbo Huang

Published in: CoRR (2024)

Keyphrases

function approximation
reinforcement learning
risk sensitive
model free
optimal control
markov decision processes
temporal difference
function approximators
reinforcement learning algorithms
temporal difference learning
learning tasks
machine learning
learning algorithm
artificial neural networks
decision theoretic
radial basis function
utility function
action selection
neural network
policy iteration
multi agent
control policies