Keyphrases
- reinforcement learning
- batch mode
- function approximation
- state space
- batch size
- learning algorithm
- evolutionary algorithm
- optimal policy
- temporal difference learning
- control problems
- robotic control
- policy search
- stochastic approximation
- batch processing
- temporal difference
- multi agent
- information retrieval
- robot control
- function approximators
- optimal control
- learning problems
- active learning
- quality prediction
- multi agent reinforcement learning
- crowd sourcing
- transition model
- decision making
- machine learning