Sign in

METREE: Max-Entropy Exploration with Random Encoding for Efficient RL with Human Preferences.

Isabel Y. N. GuanXin LiuGary ZhangEstella ZhaoZhenzhong Jia
Published in: ROBIO (2023)
Keyphrases
  • reinforcement learning
  • guided exploration
  • decision making
  • mutual information
  • computationally efficient
  • data sets
  • active learning
  • computationally expensive
  • human subjects