HIQL: Offline Goal-Conditioned RL with Latent States as Actions.

Seohong Park Dibya Ghosh Benjamin Eysenbach Sergey Levine

Published in: NeurIPS (2023)

Keyphrases

reinforcement learning
goal state
perceptual aliasing
state action
initial state
action selection
state space
state transitions
decision theoretic
markov decision processes
action sequences
partially observable domains
agent learns
state transition
action space
real time
state information
partial knowledge
reward signal
optimal policy
multiagent reinforcement learning
fully observable
multi agent
cognitive states
partially observable
model free
plan recognition
decision theoretic planning
situation calculus
machine learning