Sophisticated Swarm Reinforcement Learning by Incorporating Inverse Reinforcement Learning.
Yasuaki KuroeKenya TakeuchiPublished in: SMC (2023)
Keyphrases
- inverse reinforcement learning
- partially observable environments
- reinforcement learning
- reward function
- bayesian nonparametric
- temporal difference
- reinforcement learning algorithms
- preference elicitation
- particle swarm optimization
- partially observable
- markov decision processes
- model free
- machine learning
- state space
- approximate dynamic programming
- learning algorithm
- evolutionary algorithm
- optimal policy
- policy iteration
- graphical representation
- control policies
- dynamic programming
- decision making
- fuzzy logic