Implicit Posteriori Parameter Distribution Optimization in Reinforcement Learning.
Tianyi LiGenke YangJian ChuPublished in: IEEE Trans. Cybern. (2024)
Keyphrases
- reinforcement learning
- optimization algorithm
- markov decision processes
- learning algorithm
- global optimization
- optimization problems
- spatial distribution
- optimization method
- discrete optimization
- optimization process
- function approximation
- dynamic programming
- search algorithm
- transfer learning
- data distribution
- markov chain
- state space
- parameter values
- optimization methods
- evolutionary algorithm
- learning process
- optimization model
- action selection