• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

ORAD: a new framework of offline Reinforcement Learning with Q-value regularization.

Longfei ZhangYulong ZhangShixuan LiuLi ChenXingxing LiangGuangquan ChengZhong Liu
Published in: Evol. Intell. (2024)
Keyphrases
  • reinforcement learning
  • main contribution
  • theoretical framework
  • data sets
  • real time
  • e learning
  • multi agent
  • state space
  • conceptual framework
  • kernel machines