Keyphrases
- reinforcement learning
- function approximation
- connected components
- learning process
- machine learning
- learning algorithm
- cellular automata
- model free
- data sets
- state space
- optimal policy
- electromagnetic field
- underwater vehicles
- multi agent reinforcement learning
- temporal difference learning
- reinforcement learning algorithms
- temporal difference
- control algorithm
- markov decision processes
- dynamic environments
- dynamic programming