Student Subtyping via EM-Inverse Reinforcement Learning.
Xi YangGuojing ZhouMichelle TaubRoger AzevedoMin ChiPublished in: EDM (2020)
Keyphrases
- inverse reinforcement learning
- bayesian nonparametric
- partially observable environments
- preference elicitation
- expectation maximization
- mixture model
- learning process
- reward function
- probabilistic model
- em algorithm
- model selection
- artificial intelligence
- partial order
- dynamic systems
- gaussian process
- maximum likelihood
- np hard