Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-Reinforcement Learning.
Kei AkuzawaYusuke IwasawaYutaka MatsuoPublished in: L4DC (2021)
Keyphrases
- hidden state
- reinforcement learning
- hidden markov models
- markov models
- dynamical systems
- partially observable
- state space
- fully observable
- belief revision
- markov model
- machine learning
- markov decision processes
- optimal control
- domain independent
- belief state
- belief space
- dynamic programming
- conditional random fields
- heuristic search
- pairwise
- multi agent