Theoretical foundations for programmatic reinforcement learning.
Guruprerana ShabadiNathanaël FijalkowThéo MatriconPublished in: CoRR (2024)
Keyphrases
- theoretical foundation
- reinforcement learning
- theoretical framework
- function approximation
- state space
- markov decision processes
- action selection
- model free
- multi agent
- reinforcement learning algorithms
- markov decision process
- control problems
- temporal difference
- temporal difference learning
- continuous state
- multi agent reinforcement learning
- policy search
- data sets
- machine learning
- neural network
- databases
- database
- transfer learning
- optimal policy
- information systems
- robot control
- information retrieval
- perceptual aliasing