Login / Signup
LobsDICE: Offline Imitation Learning from Observation via Stationary Distribution Correction Estimation.
Geon-Hyeong Kim
Jongmin Lee
Youngsoo Jang
Hongseok Yang
Kee-Eung Kim
Published in:
CoRR (2022)
Keyphrases
</>
stationary distribution
imitation learning
markov chain
random walk
robotic systems
queueing networks
reinforcement learning
queue length
initial state
humanoid robot
transition probabilities
sufficient conditions
neural network
steady state
maximum margin
service times
multi modal