Login / Signup
Relaxed Stationary Distribution Correction Estimation for Improved Offline Policy Optimization.
Woosung Kim
Donghyeon Ki
Byung-Jun Lee
Published in:
AAAI (2024)
Keyphrases
</>
stationary distribution
state dependent
markov chain
random walk
queueing networks
product form
queue length
queueing model
optimal policy
initial state
service times
transition probabilities
service rates
sufficient conditions
optimal solution
steady state
wavelet transform
probabilistic model