Internally Rewarded Reinforcement Learning.
Mengdi LiXufeng ZhaoJae Hee LeeCornelius WeberStefan WermterPublished in: CoRR (2023)
Keyphrases
- reinforcement learning
- function approximation
- machine learning
- reinforcement learning algorithms
- temporal difference
- learning algorithm
- robotic control
- state space
- temporal difference learning
- model free
- direct policy search
- evolutionary learning
- reinforcement learning methods
- action selection
- optimal policy
- dynamic programming
- multi agent
- artificial intelligence
- data sets
- markov decision processes
- learning capabilities
- function approximators
- relational reinforcement learning
- databases