Generalizable Policy Improvement via Reinforcement Sampling (Student Abstract).
Rui KongChenyang WuZongzhang ZhangPublished in: AAAI (2024)
Keyphrases
- reinforcement learning
- student learning
- learning environment
- learning styles
- high level
- optimal policy
- intelligent tutoring systems
- random sampling
- monte carlo
- learning process
- knowledge level
- asymptotically optimal
- policy makers
- neural network
- online learning
- student model
- markov chain monte carlo
- university level
- award winning