Login / Signup

Online Policy Learning from Offline Preferences.

Guoxi ZhangHan BaoHisashi Kashima
Published in: CoRR (2024)
Keyphrases
  • online learning
  • learning process
  • real time
  • learning algorithm
  • active learning
  • learning systems
  • reinforcement learning
  • incremental learning
  • online training