Partially Observable Sequential Decision Making for Problem Selection in an Intelligent Tutoring System.
Emma BrunskillStuart J. RussellPublished in: EDM (2011)
Keyphrases
- partially observable
- sequential decision making
- decision problems
- reinforcement learning
- influence diagrams
- partial observability
- utility function
- state space
- markov decision processes
- partially observable environments
- optimal strategy
- optimal policy
- partial observations
- computational complexity
- temporal difference
- np hard
- reinforcement learning algorithms
- dynamical systems
- infinite horizon
- reward function
- machine learning
- belief state
- expected utility
- search algorithm
- semi supervised