Publication: SCPO: Safe Reinforcement Learning with Safety Critic Policy Optimization.