Login / Signup
Safe RLHF: Safe Reinforcement Learning from Human Feedback.
Josef Dai
Xuehai Pan
Ruiyang Sun
Jiaming Ji
Xinbo Xu
Mickel Liu
Yizhou Wang
Yaodong Yang
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
data mining
neural network
machine learning
artificial intelligence
e learning
state space
least squares
human interaction
action selection