A Novel Heuristic Exploration Method Based on Action Effectiveness Constraints to Relieve Loop Enhancement Effect in Reinforcement Learning with Sparse Rewards.
Zhenghongyuan NiYe JinPeng LiuWei ZhaoPublished in: Cogn. Comput. (2024)
Keyphrases
- reinforcement learning
- high accuracy
- dynamic programming
- detection method
- transition model
- exploration strategy
- heuristic methods
- segmentation method
- cost function
- high dimensional
- computational complexity
- search algorithm
- pairwise
- preprocessing
- clustering method
- multi agent
- markov decision processes
- search procedure
- constrained optimization
- image processing
- action selection
- learning algorithm
- machine learning