HIQL: Offline Goal-Conditioned RL with Latent States as Actions.
Seohong ParkDibya GhoshBenjamin EysenbachSergey LevinePublished in: NeurIPS (2023)
Keyphrases
- reinforcement learning
- goal state
- perceptual aliasing
- state action
- initial state
- action selection
- state space
- state transitions
- decision theoretic
- markov decision processes
- action sequences
- partially observable domains
- agent learns
- state transition
- action space
- real time
- state information
- partial knowledge
- reward signal
- optimal policy
- multiagent reinforcement learning
- fully observable
- multi agent
- cognitive states
- partially observable
- model free
- plan recognition
- decision theoretic planning
- situation calculus
- machine learning