Login / Signup
SaFormer: A Conditional Sequence Modeling Approach to Offline Safe Reinforcement Learning.
Qin Zhang
Linrui Zhang
Haoran Xu
Li Shen
Bowen Wang
Yongzhe Chang
Xueqian Wang
Bo Yuan
Dacheng Tao
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
multi agent
supervised learning
real time
real world
petri net
function approximation
modeling method
modeling framework
hidden state