Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning.

Published in: CoRR (2024)

Keyphrases