Covering Number as a Complexity Measure for POMDP Planning and Learning.
Zongzhang ZhangMichael L. LittmanXiaoping ChenPublished in: AAAI (2012)
Keyphrases
- reinforcement learning
- learning process
- machine learning
- small number
- heuristic search
- complexity measures
- state space
- partially observable
- action selection
- planning problems
- domain independent
- learning tasks
- partially observable markov decision process
- memory requirements
- learning systems
- collaborative learning
- supervised learning
- active learning
- multi agent