Login / Signup

Pareto-Optimal Learning from Preferences with Hidden Context.

Ryan BoldiLi DingLee SpectorScott Niekum
Published in: CoRR (2024)
Keyphrases
  • pareto optimal
  • learning process
  • learning algorithm
  • decision making
  • reinforcement learning
  • np complete
  • resource allocation
  • multi objective optimization
  • multi criteria