Timing and Partial Observability in the Dopamine System.
Nathaniel D. DawAaron C. CourvilleDavid S. TouretzkyPublished in: NIPS (2002)
Keyphrases
- partial observability
- planning problems
- belief state
- belief space
- partially observable
- reinforcement learning
- partial information
- partially observable markov decision processes
- dynamic programming
- neural network
- learning algorithm
- state space
- knowledge acquisition
- degrees of freedom
- planning under partial observability