Mutual Reinforcement Learning.
Sayanti RoyEmily KiesonCharles AbramsonChristopher CrickPublished in: CoRR (2019)
Keyphrases
- reinforcement learning
- function approximation
- model free
- state space
- optimal control
- learning algorithm
- optimal policy
- reinforcement learning algorithms
- temporal difference
- databases
- website
- learning process
- control problems
- markov decision processes
- robotic control
- markov decision process
- transfer learning
- dynamic programming
- hidden markov models
- artificial intelligence