Publication: Offline Reinforcement Learning with Policy Guidance and Uncertainty Estimation.