Login / Signup

Value Cores for Inner and Outer Alignment: Simulating Personality Formation via Iterated Policy Selection and Preference Learning with Self-World Modeling Active Inference Agents.

Adam SafronZahra SheikhbahaeeNick J. HayJeff OrchardJesse Hoey
Published in: IWAI (2022)
Keyphrases
  • preference learning
  • multi agent
  • multi agent systems
  • ordinal regression
  • decision theoretic
  • decision making
  • recommender systems
  • pairwise comparison
  • pairwise
  • prior knowledge
  • ranking functions
  • bayesian inference