An Online Spectral Learning Algorithm for Partially Observable Nonlinear Dynamical Systems.
Byron BootsGeoffrey J. GordonPublished in: AAAI (2011)
Keyphrases
- partially observable
- nonlinear dynamical systems
- dynamical systems
- learning algorithm
- reinforcement learning
- state space
- partial observability
- decision problems
- phase space
- online algorithms
- markov decision problems
- dynamic systems
- differential equations
- partial observations
- fixed point
- markov decision processes
- action models
- infinite horizon
- machine learning
- dynamic programming
- partially observable environments
- partially observable markov decision processes
- reward function
- learning rate
- optimal control
- orders of magnitude
- markov chain