Login / Signup
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation.
Jongmin Lee
Wonseok Jeon
Byung-Jun Lee
Joelle Pineau
Kee-Eung Kim
Published in:
CoRR (2021)
Keyphrases
</>
stationary distribution
state dependent
markov chain
random walk
product form
queueing networks
initial state
queue length
sufficient conditions
service times
transition probabilities
objective function
optimal policy
computational complexity
neural network model
queueing model
neural network
service rates