Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning.
Haichao ZhangWei XuHaonan YuPublished in: CoRR (2022)
Keyphrases
- reinforcement learning
- action selection
- exploration strategy
- active exploration
- multiple robots
- multi agent
- model based reinforcement learning
- macro actions
- complex domains
- stochastic domains
- planning problems
- temporal information
- function approximation
- generative model
- cooperative
- partially observable
- deterministic domains
- markov decision processes
- decision support
- exploration exploitation
- autonomous learning
- spatio temporal
- optimal policy
- heuristic search
- unsupervised learning
- learning process
- temporal difference
- data driven
- reinforcement learning problems
- partial observability
- reinforcement learning methods