Learning with Delayed Payoffs in Population Games using Kullback-Leibler Divergence Regularization.

Shinkyu Park Naomi Ehrich Leonard

Published in: CoRR (2023)

Keyphrases

kullback leibler divergence
learning algorithm
reinforcement learning
prior knowledge
pairwise
unsupervised learning
information theoretic
game theory
active learning
supervised learning
markov random field
information theory