Login / Signup
Online Policy Learning from Offline Preferences.
Guoxi Zhang
Han Bao
Hisashi Kashima
Published in:
CoRR (2024)
Keyphrases
</>
online learning
learning process
real time
learning algorithm
active learning
learning systems
reinforcement learning
incremental learning
online training