Keyphrases
- density estimation
- policy search
- reinforcement learning
- mixture model
- continuous state
- dynamic programming
- probability density function
- outlier detection
- policy gradient
- reinforcement learning algorithms
- gaussian mixture model
- language model
- data mining
- reward function
- partially observable markov decision processes
- markov decision problems
- em algorithm
- expectation maximization
- image segmentation
- machine learning