Malthusian Reinforcement Learning.
Joel Z. LeiboJulien PérolatEdward HughesSteven WheelwrightAdam H. MarblestoneEdgar A. Duéñez-GuzmánPeter SunehagIain DunningThore GraepelPublished in: CoRR (2018)
Keyphrases
- reinforcement learning
- function approximation
- machine learning
- temporal difference
- state space
- temporal difference learning
- reinforcement learning algorithms
- learning process
- control problems
- learning algorithm
- multi agent
- multi agent reinforcement learning
- optimal control
- dynamic programming
- database
- markov decision processes
- learning agents
- robot control
- optimal policy
- stochastic approximation
- evolutionary learning
- perceptual aliasing
- learning capabilities
- model free
- least squares
- active learning
- evolutionary algorithm
- search algorithm
- support vector
- artificial intelligence
- real world