Theoretical foundations for programmatic reinforcement learning.

Guruprerana Shabadi Nathanaël Fijalkow Théo Matricon

Published in: CoRR (2024)

Keyphrases

theoretical foundation
reinforcement learning
theoretical framework
function approximation
state space
markov decision processes
action selection
model free
multi agent
reinforcement learning algorithms
markov decision process
control problems
temporal difference
temporal difference learning
continuous state
multi agent reinforcement learning
policy search
data sets
machine learning
neural network
databases
database
transfer learning
optimal policy
information systems
robot control
information retrieval
perceptual aliasing