VOCE: Variational Optimization with Conservative Estimation for Offline Safe Reinforcement Learning.

Published in: NeurIPS (2023)

Keyphrases