Login / Signup
Efficient RLHF: Reducing the Memory Usage of PPO.
Michael Santacroce
Yadong Lu
Han Yu
Yuanzhi Li
Yelong Shen
Published in:
CoRR (2023)
Keyphrases
</>
memory usage
memory footprint
computationally expensive
memory requirements
database
artificial intelligence
information systems
neural network
machine learning
computer vision
similarity measure