Keyphrases
- reinforcement learning
- markov decision processes
- function approximation
- reinforcement learning algorithms
- state space
- learning algorithm
- sparse data
- reward function
- machine learning
- supervised learning
- temporal difference learning
- dynamic programming
- temporal difference
- model free
- reward shaping
- optimal policy
- neural network
- sparse representation
- compressive sensing
- hidden state
- robotic control
- action selection
- optimal control
- partially observable
- action space
- sparse matrix
- multi agent