• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Zero-shot Preference Learning for Offline RL via Optimal Transport.

Runze LiuYali DuFengshuo BaiJiafei LyuXiu Li
Published in: CoRR (2023)
Keyphrases
  • preference learning
  • reinforcement learning
  • gaussian processes
  • multi agent
  • closed form
  • dynamic programming
  • pairwise comparison
  • ordinal regression
  • data mining
  • objective function
  • active learning
  • semi supervised