Value Cores for Inner and Outer Alignment: Simulating Personality Formation via Iterated Policy Selection and Preference Learning with Self-World Modeling Active Inference Agents.
Adam SafronZahra SheikhbahaeeNick J. HayJeff OrchardJesse HoeyPublished in: IWAI (2022)