Login / Signup
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction Estimation.
Abhinav Jain
Vaibhav V. Unhelkar
Published in:
AAAI (2024)
Keyphrases
</>
stationary distribution
imitation learning
markov chain
random walk
queue length
transition probabilities
queueing networks
reinforcement learning
initial state
robotic systems
active learning
parameter estimation
steady state
humanoid robot
maximum margin