Keyphrases
- gradient ascent
- bandit problems
- active exploration
- decision problems
- active learning
- small sample
- problem based learning
- cross entropy
- expectation maximization
- exponential family
- computational complexity
- feature vectors
- supervised learning
- sufficient conditions
- objective function
- game playing
- policy gradient
- artificial neural networks
- multi agent systems