Mixed experience sampling for off-policy reinforcement learning.
Jiayu YuJingyao LiShuai LüShuai HanPublished in: Expert Syst. Appl. (2024)
Keyphrases
- reinforcement learning
- function approximation
- model free
- reinforcement learning algorithms
- random sampling
- data mining
- supervised learning
- user experience
- temporal difference learning
- learning curve
- temporal difference
- monte carlo
- state space
- multi agent
- learning algorithm
- real world
- markov decision processes
- sample size
- sampling methods
- policy search