Adaptable Agent Populations via a Generative Model of Policies.
Kenneth DerekPhillip IsolaPublished in: NeurIPS (2021)
Keyphrases
- generative model
- reward function
- probabilistic model
- multi agent
- bayesian framework
- prior knowledge
- multiagent systems
- discriminative learning
- em algorithm
- latent dirichlet allocation
- multi agent systems
- semi supervised
- multiple agents
- topic models
- discriminative models
- posterior probability
- optimal policy
- markov decision process
- expectation maximization
- hierarchical bayesian model
- feature space