Foresight Distribution Adjustment for Off-policy Reinforcement Learning.
Ruifeng ChenXu-Hui LiuTian-Shuo LiuShengyi JiangFeng XuYang YuPublished in: AAMAS (2024)
Keyphrases
- reinforcement learning
- function approximation
- spatial distribution
- probability distribution
- state space
- model free
- multi agent
- reinforcement learning algorithms
- database
- uniformly distributed
- learning tasks
- markov decision processes
- data distribution
- markov chain
- dynamic programming
- decision trees
- website
- learning algorithm
- real time