A Robust Policy Bootstrapping Algorithm for Multi-objective Reinforcement Learning in Non-stationary Environments.
Sherif M. AbdelfattahKathryn E. KasmarikJiankun HuPublished in: CoRR (2023)
Keyphrases
- multi objective
- optimization algorithm
- learning algorithm
- computationally efficient
- reinforcement learning
- preprocessing
- detection algorithm
- objective function
- parameter tuning
- dynamic programming
- particle swarm optimization
- model free
- simulated annealing
- k means
- optimal solution
- machine learning
- search space
- matching algorithm
- computational complexity
- expectation maximization
- state space
- optimal policy
- benchmark problems
- function approximation
- test problems
- function approximators
- policy search